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COMPOSITIONS THAT INHIBIT PROLIFERATION OF CANCER 

CELLS 

This application claims priority to United States Provisional Application No. 60/239,705 
filed on October 12, 2000, entitled "Agents promoting apoptosis in cancer cells via interruption of 
5 oncogene-induced integrin signaling," which application is herein incorporated by reference in its 
entirety and to United States Provisional Application No. 60/242,812 filed on October 24, 2000 
entitled, "Agents promoting apoptosis in cancer cells via interruption of oncogene-induced integrin 
signaling", and which application is herein incorporated by reference in its entirety. 

I. BACKGROUND OF THE INVENTION 

10 In cancer cells multiple oncogenic lesions cooperate in malignant transformation. Such 

cooperation permits survival and proliferation of tumor cells in absence of contact with extra- 
cellular matrix (ECM), suggesting that tumor cell survival and proliferation have become 
independent of the engagement of integrin signaling by ECM. 

Carcinogenesis is caused by multiple cooperating genetic lesions leading to a progressive 
1 5 deregulation of cellular signaling and cell cycle restriction point control. The mutations involved 
result in oncogene activation or loss of tumor-suppressor gene function. Typically, single 
oncogenes are insufficient to cause malignant transformation because they simultaneously induce 
signals stimulating and inhibiting cell growth. As a result cell proliferation remains restricted. In 
contrast, cooperating oncogenic lesions act in concert to disable such inhibitory signals while 
20 reinforcing the growth-promoting stimuli. The co-operation of oncogenic lesions involves 
integration of multiple signals converging on the regulation of cell cycle-dependent kinase 
complexes (Lloyd et al., 1997; Perez-Roger et al. $ 1999; Roper etal, 2001; Sewing etal., 1997). 

Disclosed herein are compositions and methods that show survival of various transformed 
cell types requires cell-autonomous (autocrine) integrin signaling activity. This activity is induced 
25 by cooperating oncogenic lesions and involves induction of integrin receptor and Iigand 

components such as integrin alpha6 and integrin beta4, and Iaminin5-gamma2 chains. Blocking of 
integrin or the laminin ligand function induces rapid apoptosis of the transformed cells, even when 
growing in presence of ECM. In contrast, normal cells remain viable when exposed to the same 
treatment. 

30 The disclosed compositions and methods are related to the cooperation of oncogenic 

lesions controlling the ability of transformed cells to proliferate in the absence of contact with the 
extra-cellular matrix (ECM). As taught herein, oncogenes cooperate to promote a cell-autonomous 
(autocrine) integrin signaling loop that proves essential for the survival of various transformed cell 
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types. As this signaling loop is not established in corresponding normal cells, the signaling 
components of this loop constitute attractive targets for cancer therapy. 

II. SUMMARY OF THE INVENTION 

In accordance with the purposes of this invention, as embodied and broadly described 
5 herein, this invention, in one aspect, relates to compositions and methods related to integrin 
mediated cancer cell growth. 

Additional advantages of the invention will be set forth in part in the description which 
follows, and in part will be obvious from the description, or may be learned by practice of the 
invention. The advantages of the invention will be realized and attained by means of the elements 
10 and combinations particularly pointed out in the appended claims. It is to be understood that both 
the foregoing general description and the following detailed description are exemplary and 
explanatory only and are not restrictive of the invention, as claimed. 

in. BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and constitute a part of this 
15 specification, illustrate several embodiments of the invention and together with the description, 
serve to explain the principles of the invention. 

Figure 1 shows a series of schematics representative of the disclosed relationships and 
compositions. In normal cells integrin receptors signal to suppress programmed cell death 
(apoptosis) when engaged by appropriate extra-cellular matrix (ECM) ligands. When receptor- 

20 ligand interaction is lost, the cells undergo apoptosis due to the lack of survival signals (Figure 1-. 
Panel 1). In cancer cells multiple oncogenic lesions cooperate to cause malignant transformation. 
Such cooperation permits survival and proliferation of tumor cells independent of integrin 
receptor-ECM interactions. This property has been termed anchorage-independence (Panel 2). 
We have discovered the mechanistic basis of anchorage-independence. The transformed cells 

25 replace the requirement for ECM-dependent signaling with a surrogate integrin signaling loop on 
which they rely for survival. In colonic epithelial cells, activation of Ras in combination with APC 
(adenomatous polyposis coli) or p53 mutations leads to induction of integrin receptor and ligand 
components such as integrin alpha6 and laminin gamma2 chains. As a consequence, laminin- 
dependent activation of alpha6/beta4 integrin receptors signals to inhibit caspase activity and thus 

30 to suppress apoptosis (Panel 3)» Ablation of integrin alpha6, laminin ligand function or 

alpha6fteta4 integrin receptor function induces apoptosis of the transformed cells, even when 
growing in the presence of ECM. In contrast, non-transformed control cells remain viable when 
exposed to the same treatment, indicating that the dependence of the transformed ceils on 
autocrine integrin signaling may be a particular feature of the cancer cell phenotype. The essential 

35 role of alpha6 integrin extends to various transformed cell types including mesenchymal and 
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SW480 human colon carcinoma cells. Thus inhibition of laminin/alpha6 integrin-mediated 
signaling is an important method to induce cancer cell-specific death in a variety of cell types 
(Panel 4). 

Figure 2 shows that oncogene cooperation protects cells from apoptosis. Figure 2A shows 
5 control, APCm, Ras, APCm+Ras cells that were detached from collagen IV substrates with 
Trypsin/EDTA and kept in suspension at 2x10 s cells/ml in RPMI 10%FCS for 12 h at 39°C. 
Subsequently TUNEL analysis was performed on poly-lysine treated slides. The percentage of 
tunel-positive cells was determined by immunofluorescence microscopy. Figure 2B shows 
control, APCm, Ras, APCm Ras, SW480 cells were detached and maintained in suspension as 
10 described in (A). SW480 beta4 dn/Gal4VPER cells, express a 40H-tamoxifen-inducible 
dominant-negative form of the beta4 integrin. Cells were pelleted and protein extracts were 
prepared in 300 \xl of 50 mM Tris-HCl, pH 7,4; 1% NP40 ; 0.25% sodium deoxycholate; 150 mM 
NaCl ; 1 mM EGTA. 200 ul of extracts were incubated for 10 min with 2 ^1 of Caspase 3 
fluorometric Substrate (Upstate Biotechnology). Vmax of caspase activity was determined by 
15 measuring the fluorescence at 460 nm after excitation at 380 nm for I h. Figure 2C shows laminin 
gamma 1 and gamma2-specific peptides were added to APCmin+Ras cells at the concentration of 
lOO^g/ml. Caspase activity was measured as in (B). Figure 2D shows caspase 8 and caspase 9 
activities were measured as in (B) using caspase 8 and caspase 9-specific fluorimetric substrates 
(Upstate Biotechnologies). 

20 Figure 3 shows alterations of integrin and laminin expression profiles in malignant cell 

transformation. Figure 3 A shows integrin expression and figure 3B shows laminin expression. 
The indicated cell populations were cultured on collagen IV-coated dishes at 39°C in RPMI 10% 
FCS. Total RNA was extracted from 10 6 cells for each sample and used for RT-PCR(laminins, 
alpha4, gamma2 and GAPDH) or RNase protection. For RT-PCR, cDNA was subjected to 28 

25 cycles (linear range) of PCR amplification. PCR products were analyzed on a 2% agarose gel. For 
RNase protection, 10 ng of total RNA was used per reaction. Products were resolved on a 4.5% 
polyacrylamide/lOM urea gel. 

Figure 4 shows alpha6/beta4 integrin is engaged by the laminin garnma2 chain to activate 
She (a sre homology domain containing protein). Figure 4A shows cells detached from collagen 

30 IV coated dishes with 3mM EDTA (in PBS) were incubated on ice with the indicated antibodies 
for 1 h. Cells were then plated on 96well dishes coated with gammal or gamma2-specific peptides 
and were permitted to attach for 30 min in RPMI medium at 39°C. After incubation, wells were 
washed with RPMI. The percentage of attached cells per well was measured by hexoaminidase 
activity after lysis of the cells in the well and incubation with the substrate p-nitrophenol-N-acetyl- 

35 p-D-glucosaminide for 5 h at 37°C Figure 4B shows cells indicated were detached as in (A) and 
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resuspended at 10 7 cells/ml in PBS. After 1 h of incubation on ice, cells were pelleted by 
centrifugation 5 min at 900 rpm and resuspended in RPMI containing the phosphatase inhibitor 
pervanadate at 10|iM* The cells were incubated with betai and beta4 integrin antibodies as well as 
the laminin gamma2-specific peptide for 40 min. Protein extracts were prepared and subjected to 
5 an immuno-precipitation with an anti-She antibody. Phospho-tyrosine was detected with a 
phopho-tyrosine-specific antibody in IP-wcstem-blots (upper panels). The levels of She protein 
were monitored with a She-specific antibody reusing the same membrane (lower panels). 

Figure 5 shows integrin alpha6/beta4 and Laminin gamma2 chain expression is essential 
for survival of transformed cells. Figure 5A1 shows the indicated cell populations were infected at 

10 an MOl of 2 with recombinant retroviruses expressing anti-sense RNA for alpha6 integrin, the 
gamma2 laminin chain or a beta4 integrin dominant-negative mutant together with a puromycin 
resistance gene. After 2 weeks of selection with puromycin, colonies were stained with Giemsa 
and counted. APCm+Ras alpha6 and APCm+Ras p35 cells express the alpha6 integrin chain and 
the anti-apoptotic baculovirus p35, respectively. APCm+Ras cells were also plated on dishes pre- 

15 coated with a laminin gamma2-specific peptide. Figure 5A2 shows APCm+Ras hygro, 

APCm+Ras Bcl2 and APCm+Ras p35 cells express the hygromycin resistance marker, and the 
anti-apoptotic proteins Bcl2 or p35, respectively. The cells were infected at an MOI of 2 with 
retroviruses carrying the beta4 integrin dominant-negative mutant and a puromycin resistance 
gene, or the puromycin resistance gene alone. Cells (10 5 ) were maintained in soft agar at 39°C for 

20 two weeks after which macroscopically visible colonies were counted. Figure 5A3 shows the 
indicated cell populations were infected at an MOI of 2 with recombinant retroviruses, drug- 
selected and maintained as described in Al. Figure 5A4 shows the indicated cell populations were 
infected at an MOI of 2 with recombinant retroviruses expressing anti-sense RNA for alpha6 
integrin, the beta4 integrin dominant-negative mutant or the puromycin resistance marker. Cells 

25 (10 5 ) were maintained in soft agar and evaluated as described in A2. Figure 5B shows the effect 
of the alpha6 anti-sense RNA expression on the cell surface expression of alpha6 integrin was 
monitored by FACS analysis using an anti-alpha6 rat monoclonal antibody and a F1TC conjugated 
goat anti-rat antiserum as the secondary antibody (upper panel). Expression of the beta4 
dominant-negative mutant was confirmed by RNAse protection. A probe overlapping the c- 

30 terminal end of the beta4 dominant negative mRNA was used to measure beta4 integrin and beta4 
dn expression in the same sample (lower panel). Figure 5C shows arf null mouse embryo 
fibroblasts (MEFs) that were infected with retroviruses expressing the oncogenes Myc and Ras. 
Cells were then additionally infected with a retroviruses expressing anti-sense RNA for alpha6 
integrin or the beta4 integrin dominant-negative mutant as shown in (A). MEFS 

35 Ras/Myc/Arfnull+aIpha6 sense express the alpha6 integrin chain. Figure 4D shows SW480 human 
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colon carcinoma cells that were infected as described in (A) using VSV pseudo-typed viruses. 
Infected cells were selected with puromycin in soft agar. Clones were counted 2 weeks after 
selection. SW480 beta4 dn/Gal4VPER cells, express a 40H-tamoxifen-inducible dominant- 
negative form of the beta4 integrin. The cells (10 5 ) were maintained in soft agar for 2 weeks in 
5 presence or absence of 40H-tamoxifen. 

IV. DETAILED DESCRIPTION 

The present invention may be understood more readily by reference to the following 
detailed description of preferred embodiments of the invention and the Examples included therein 
and to the Figures and their previous and following description. 

1 0 Before the present compounds, compositions, articles, devices, and/or methods are 

disclosed and described, it is to be understood that this invention is not limited to specific synthetic 
methods, specific recombinant biotechnology methods unless otherwise specified, or to particular 
reagents unless otherwise specified, as such may, of course, vary. It is also to be understood that 
the terminology used herein is for the purpose of describing particular embodiments only and is 

15 not intended to be limiting. 

A. Definitions 

As used in the specification and the appended claims, the singular forms "a," "an" and 
"the" include plural referents unless the context clearly dictates otherwise. Thus, for example, 
reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, and the 
20 like. 

Ranges may be expressed herein as from "about" one particular value, and/or to "about" 
another particular value. When such a range is expressed, another embodiment includes from the 
one particular value and/or to the other particular value. Similarly, when values are expressed as 
approximations, by use of the antecedent "about," it will be understood that the particular value 
25 forms another embodiment. It will be further understood that the endpoints of each of the ranges 
are significant both in relation to the other endpoint, and independently of the other endpoint. 

In this specification and in the claims which follow, reference will be made to a number of 
terms which shall be defined to have the following meanings: 

"Optional" or "optionally" means that the subsequently described event or circumstance 
30 may or may not occur, and that the description includes instances where said event or circumstance 
occurs and instances where it does not. 

"Primers" are a subset of probes which are capable of supporting some type of enzymatic 
manipulation and which can hybridize with a target nucleic acid such that the enzymatic 
manipulation can occur. A primer can be made from any combination of nucleotides or nucleotide 
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derivatives or analogs available in the art which do not interfere with the enzymatic manipulation. 

"Probes" are molecules capable of interacting with a target nucleic acid, typically in a 
sequence specific manner, for example through hybridization. The hybridization of nucleic acids 
is well understood in the art and discussed herein. Typically a probe can be made from any 
5 combination of nucleotides or nucleotide derivatives or analogs available in the art. 

B. Compositions and methods 

Disclosed are compositions and methods related to integrins and integrin signaling. It is 
shown herein that integrin alpha6, integrin beta4, and lamininS, through at least up regulation of 
the beta and gamma chains of lamininS, are upregulated in cancer cells. Integrin alpha6 and 

10 integrin beta4 interact to form the integrin receptor, A6B4. The integrin receptor A6B4 

specifically interacts with lamininS and lamininS specifically interacts with A6B4. Furthermore, it 
is shown that interference with the production or function of alpha6, beta4, or the laminin gamma2 
chain not only prevents proliferation of the cancer cells, dependent on the upregulation of alpha6, 
beta4, and the laminin gamma2 chain, but that this kills the cancer cells as well. Alpha6 and beta4 

15 signaling occur through the integrin receptor A6B4. Thus, interference with the formation of 
A6B4, will interfere with the function of A6B4, for example, the signaling of A6B4. Thus, 
disclosed are compositions and methods that interfere with the function of alpha6, beta4, lamininS, 
the laminin gamma2 chain, or A6B4. Also disclosed are compositions and methods that interfere 
with the function of molecules involved with the signal transduction that is connected to either 

20 alpha6, beta4, lamininS, the laminin gamma2 chain, or A6B4. Also disclosed are methods for 
reducing the proliferation of cancer cells, as well as methods of killing cancer cells that involve 
using the compositions disclosed herein that interfere, reduce, or eliminate the function or the 
alpha6, beta4, laminin5, the laminin gamma2 chain, or A6B4 function. 

Disclosed herein is a relationship between two types of molecules in a cancer cell. The 
25 first type of molecule is an integrin receptor, composed of integrins, and the second type of 
molecule is a ligand that interacts with the integrin receptor, through the integrins. There are 
specificities that exist between the integrin receptors and their ligands. One aspect, disclosed 
herein is that when a cell goes from a non-cancerous state to a cancerous state, there is a co- 
uprcgulation of both the ligand (or parts of the ligand, such as subunits) and the cognate integrin 
30 receptor. The co-upregulation of both types of molecules creates an autocrine loop situation, 
wherein the signaling pathways controlled by the integrin receptor become autonomously 
activated, rather than exogenousiy activated, as would normally occur. The upregulation of both 
types of molecules creates a more fully transformed cellular phenotype in which cancer cell 
survival depends on the autocrine loop. Now therapeutic activities can target both points in the 
35 autocrine loop. Specific examples, of this co-upregulation in cancer cells are disclosed herein. For 
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example, laminin 5 (both the beta and gamma2 chains are upregulated) and integrin receptor 
alpha6beta4; laminin 10/1 1 and the integrin receptors alpha6betal and alpha3betal. 

As the Examples herein indicate which integrins, which integrin receptor, and which 
integrin ligand are involved in conferring cancer cells ability to grow in the absence of extra 
5 cellular matrix (ECM), also disclosed are methods using these integrins, integrin receptors and 
ligands to identify molecules that interact with them and/or interfere with their function. 

1. Compositions 

Disclosed are the components to be used to prepare the disclosed compositions as well as 
the compositions themselves to be used within the methods disclosed herein. These and other 

10 materials are disclosed herein, and it is understood that when combinations, subsets, interactions, 
groups, etc. of these materials are disclosed, while specific reference of each various individual 
and collective permutation of these compounds may not be explicitly disclosed, each is 
specifically contemplated and described herein. For example, if a particular beta4 or alpha6 is 
disclosed and discussed and a number of modifications that can be made to a number of molecules 

15 including the modifications to beta4 or alpha6 are discussed, specifically contemplated is each and 
every combination and permutation of these modifications and the modifications that are possible 
unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are 
disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, 
A-D is disclosed, then even if each is not individually recited, each is individually and collectively 

20 contemplated. Thus, combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered 
disclosed. Likewise, any subset or combination of these is also disclosed. Thus, for example, the 
sub-group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all 
aspects of this application including, but not limited to, steps in methods of making and using the 
disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is 

25 understood that each of these additional steps can be performed with any specific embodiment or 
combination of embodiments of the disclosed methods. 

a) Integrins and their liaands 
For proper embryonic development, tissue homeostasis, and wound healing, cell 
proliferation must be tightly regulated, both in space and over time. In particular, a cell must be 
30 able to sense its relationship to other cells and the extra-cellular matrix (ECM) and convert these 
positional cues into biochemical signals affecting the regulation of proliferation. Because of their 
ability to couple the recognition of positional cues to the activation of intracellular signaling 
pathways, adhesion receptors, such as integrins and cadherins, are likely to be necessary to achieve 
this goal. 

35 The integrins mediate cell adhesion primarily by binding to distinct, although overlapping, 
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subsets of ECM proteins. Normal cells require contact with serum-derived ECM components for 
proliferation, differentiation and survival (e.g. Clark and Brugge, 1995; Lin and Bissell, 1993; 
Parise et aL, 2000), a phenomenon called anchorage-dependence. This involves signaling through 
integrin receptors (Hynes, 1992). Fibronectin and laminin as well as other ECM proteins are 

5 known to act as ligands for integrin receptors (Akiyama et aL, 1990). Integrins are transmembrane 
proteins forming alpha-beta chain heterodimers. Alpha and beta chain integrins are members of 
distinct gene families. The ligand binding specificity of the hetero dimers is determined by 
specific combinations of alpha and beta chain gene family members (Hynes, 1992). Ligand 
binding triggers signaling of integrin receptors through the cytoplasmic tail of the beta chain via 

10 interaction with various signaling components. 

Integrins activate common as well as subgroup-specific signaling pathways (Clark and 
Brugge, 1995; Giancotti and Ruoslahti, 1999). In particular, while most integrins activate focal 
adhesion kinase (FAK), the al(3t, aSpi, av03 and a6(34 integrins are coupled to the Ras- 
extracellular signal-regulated kinase (ERK) signaling pathway by She (Mainiero et aL, 1997; 

15 Mainiero et aL, 1995; Wary et aL, 1996). She is an SH2-PTB domain adapter protein expressed in 
three forms, p46, p52 and p66, two of which (p46 and p52) link various tyrosine kinases to Ras by 
recruiting the Grb2/SOS complex to the plasma membrane (Pawson and Scott, 1997). Upon 
activation by SOS, Ras stimulates a kinase cascade culminating in the activation of the mitogen- 
activated protein kinase (MAPK) ERK (Marshall, 1995). ERK phosphoryiates ternary complex 

20 transcription factors, such as Elk-1 and Sap-1/2, and promotes transcription of the immediate-early 
gene Fos (Treisman, 1996). In primary endothelial cells and keratinocytes, mitogens and She- 
linked integrins cooperate, in a synergic fashion, to promote transcription from the Fos promoter. 
Accordingly, ligation of integrins linked to She enables these cells to progress through Gl in 
response to mitogens, whereas ligation of other integrins results in growth arrest, even in the 

25 presence of mitogens (Mainiero et aL, 1997; Wary et aL, 1996). She is like a binary switch 

controlling cell cycle progression in response to the ECM. Moreover, integrin receptors have been 
shown to induce intracellular signaling leading to AKT activation supporting cell survival (Lee 
and Juliano, 2000). 

In contrast with normal cells, cancer cells generally are able to survive and proliferate in 
30 the absence of anchorage to ECM (Giancotti and Mainiero, 1994), suggesting that tumor cell 
survival and proliferation have become independent of the engagement of integrin signaling 
through ECM. 

Proliferation in the absence of anchorage to ECM of secondary rat embryo fibroblasts 
requires the cooperation of Ras and Myc or Ras and adenovirus Ela oncogenes (Land et aL, 1983; 
35 Ruley, 1983). Similarly, murine colonic epithelial cells require both activated Ras and mutation of 
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the adenomous polyposis coli gene (APCmin) (D'Abaco et al., 1996) in order to proliferate in 
suspension. 

Integrins are a large family of cell surface receptor molecules that function to mediate 
interactions between cells and between cells and the extracellular matrix. Integrin receptors are 
5 heterodimers composed of two subunits, an alpha integrin and a beta integrin. The heterodirner 
forms, is expressed on the cell surface, and acts to transmit signals obtained from interactions with 
the extracellular matrix or other cells, through the cellular membrane and into the cytosol of the 
cell. The signal transduction that takes place occurs because of ligand interactions with the 
receptor. Integrin receptors can have a number of ligands, including collagens, fibronectins, and 
10 laminins. 

There are currently at least 1 8 different alpha integrins and at least 8 different beta 
integrins that have been shown to form at least 24 different alphabeta heterodimers. Certain 
integrins, such as betal, interact in a number of different heterodimers, but many subunits only 
form a single heterodirner, either because of structural constraints on their interactions, or cellular 

1 5 expression patterns that provide only a limited number of potential dimer partners. Disclosed 
herein are specific relationships that occur between a subset of integrins, integrin receptors, and 
their ligands. The disclosed relationships, revolve around the alpha6beta4 receptor, formed by the 
alpha6 and beta4 integrins. Of particular interest is the relationship between the ligand for the 
alpha6beta4 receptor, lamininS. The laminins are made up of 3 chains, an alpha chain, a beta 

20 chain, and a gamma chain. The specificity of the interaction between lamininS and alpha6beta4 
receptor is controlled by the gamma chain. LamininS contains a gamma2 chain which only 
interacts with the alpha6beta4 integrin receptor. 

Integrin alpha6 has seven amino-terminal repeating segments that may fold into a seven 
unit beta-propeller, five n-terminal FG-GAP domains and three divalent cation sites. The 

25 transmembrane domain is followed by a short cytoplasmic tail, that is alternatively spliced in A 
and B forms. The alpha6 integrin chain also shows alternative splicing between repeat units III 
and IV, resulting in the presence or absence of Exon X2. Integrin alpha6 is processed into a heavy 
and a light chain that are disulphide linked. A representative allele of the human alpha6 cDNA is 
set forth in SEQ ID NO:L It is understood that the disclosed functional domains as well as the 

30 others contained within alpha6 are considered separately disclosed as discreet fragments of the 
alpha6 protein as well as the nucleic acid that encodes them. 

Integrin beta4 contains a MIDAS-like motif and four cystcine-rich repeats, three EGF-like 
domains in the N-terminal extracellular domain, a trans-membrane region and a long cytoplasmic 
tail containing two pairs of fibronectin Type III repeats. The latter are connected by a variable 
35 segment that may undergo alternative splicing. Integrin beta4 also undergoes proteolytic 
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processing in its cytoplasmic tail, causing the 200kD mature form to be converted to 165 and 
130kD fragments. A representative allele of the human beta4 cDNA is set forth in SEQ ID NO:5. 
It is understood that the disclosed functional domains as well as the others contained within beta4 
are considered separately disclosed as discreet fragments of the beta4 protein as well as the nucleic 
5 acid that encodes them. 

Laminin5 is composed of the laminin chains alpha3, beta3 and gamma2. LamininS can 
may contain either the shorter laminin alpha3 A chain or the longer alpha3B chain. Laminin5 can 
also be trimmed by proteolytic processing of the N-terminal portion of its alpha3 A chain and the 
N-terminal portion of the gamma2 chain. The Laminin gamma2 chain contains at least six laminin 

10 EGF-like domains (Domains III and V) with an embedded laminin B domain (Domain IV) within 
the N-terminal half. The c-terminal tail contains a coiled-coil domain. The N-terminal processed 
portion of the gamma2 chain is sufficient to bind to and activate the integrin alpha6/beta4 receptor. 
A representative allele of the human Iaminin5-gamma2 cDNA is set forth in SEQ ID NO: 13. It is 
understood that the disclosed functional domains as well as the others contained within the 

15 laminin5 protein and Iaminin5-gamma2 protein are considered separately disclosed as discreet 
fragments of the laminin5 protein and Iaminin5-gamma2 protein as well as and the nucleic acid 
that encodes them. 

Disclosed are compositions and methods for inhibiting integrin signaling, for example, 
integrin signaling dependent on alpha6 and beta4 integrins. For example, compositions and 

20 methods that inhibit integrin receptor signaling from, for example, the alpha6beta4 integrin 

receptor are disclosed. It is also understood that the integrin receptor signaling can be affected by, 
for example, interfering with a molecule, such as a ligand for the integrin receptor, or a 
downstream signaling molecule of the integrin receptor in a way that prevents the integrin receptor 
signal from being fully propagated. It is understood that the compositions and methods for 

25 inhibition of integrin signaling and function can be any composition or method that ultimately 
inhibits the cell proliferation in which the integrin is expressed, by for example killing the cell. It 
is understood that the compositions and methods typically can fall into three basic non-limiting 
classes of function regulators, which are discussed herein. 

(1) Classes 

30 (a) Production regulators 

Production regulators is a broad class of integrin function regulators that are directed at the 
production of the target integrin, by for example, preventing mRNA synthesis or expression of the 
target integrin, or by causing mRNA degradation of the target integrin which inhibits the 
translation of the target integrin. While production regulators, can be any type of molecule 

35 targeting any point in the integrin production pathway, typically these types of compositions will 
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target either the mRNA expression or the protein expression of the integrin. For example, if beta4 
integrin, alpha6 integrin, or the gamma2 subunit of laminin5, which has been shown herein to be 
upregulated in cancer cells and which causes the cancer cell to be able to live in the absence of the 
ECM, was the target integrin, a typical production regulator of beta4 integrin, alpha6 integrin, or 
5 the gamma2 subunit of lamininS would be for example, an antisense molecule that targeted the 
mRNA of beta4 integrin, alpha6 integrin, or the gamma2 subunit of lamininS. It is also understood 
that a production regulator could also target any molecule that is disclosed herein, or is within the 
signaling chain associated with a target integrin or integrin receptor. For example, the inhibition 
of the production of the ligand for a target integrin receptor is one way of inhibiting integrin 
10 function of that receptor. Thus, production regulators can either inhibit or enhance integrin 
production. 

(b) Integrin to integrin regulators 

Another type of integrin function regulator is an integrin-integrin regulator. This type of 
function regulator, typically prevents integrins from interacting to form a functional integrin 

1 5 receptor. For example, an integrin to integrin regulator could be a composition that would interact 
with beta4 in a way that would prevent beta4 from interacting with alpha6 to form the alpha6beta4 
integrin receptor or it could be a composition that would interact with alpha6 in a way that would 
prevent alpha6 from interacting with beta4 to form the alpha6beta4 integrin receptor. It is also 
contemplated that the function regulators of the integrin to integrin interaction can affect the 

20 signaling pathways dependent on integrin receptors containing alph6 or beta4 integrins. It is not 
required that an integrin to integrin regulator prevent an integrin from interacting with all of the 
possible integrin partners it .could interact with, just that it prevent the interaction of the target 
integrin with another specific integrin. For example, the integrin alpha6 interacts with betal 
integrin and beta4 integrin. In certain embodiments, the compositions interfere with alpha6-beta4 

25 interactions but do not interfere with alpha6-betal interactions. 

(c) Integrin to other molecule regulators 

The third class of function regulators are the integrin to other molecule regulators. These 
compositions are designed to specifically interfere with molecules such as small molecule ligands 
or other proteins that interact with the integrin or integrin receptor. For example, an integrin to 

30 other molecule regulator might target a ligand for a particular integrin receptor, such as the 

alpha6beta4 receptor. The ligand for the alpha6beta4 receptor is laminin5 or the laminin gamma2 
chain. Compositions that interact with lamininS such that lamininS or the laminin gamma2 chain 
interactions with alpha6beta4 are inhibited or reduced are specifically contemplated herein. 
Likewise, there are other molecules, such as She molecules, that also interact with integrins, such 

35 as the alpha6beta4 receptor. Compositions that specifically interact with the She molecules such 
that they prevent the appropriate interactions between the She molecule and the alpha6beta4 
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receptor are disclosed. 

(2) Types 

Just as there are different general classes of molecules that can regulate (such as by 
inhibition) the function of the disclosed integrins and integrin receptors, so to there are many 

5 different types of molecules that perform that regulation. For example, any molecule that can 
perform the regulation of for example, the disclosed integrins, integrin receptors, or signaling 
pathways produced by the disclosed integrins and integrin receptors are contemplated. For 
example, antibodies or small molecules which inhibit the disclosed compositions are herein 
disclosed. Also disclosed are, for example, functional nucleic acids, such as ribozymes or 

10 antisense molecules that can inhibit the disclosed integrin function in a variety of ways. A non- 
limiting list of exemplary molecules is discussed herein. 

(a) Antibodies 

Antibodies can be used to regulate, for example, the function of the disclosed integrins and 
integrin receptors, molecules that interact with the disclosed integrin receptors, and molecules in 
15 the signaling pathways of the disclosed integrin receptors. 

As used herein, the term "antibody" encompasses, but is not limited to, whole 
immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually 
heterotetrameric glycoproteins, composed of two identical light (L) chains and two identical heavy 
(H) chains. Typically, each light chain is linked to a heavy chain by one covalent disulfide bond, 

20 while the number of disulfide linkages varies between the heavy chains of different 

immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain 
disulfide bridges. Each heavy chain has at one end a variable domain (V(H)) followed by a 
number of constant domains. Each light chain has a variable domain at one end (V(L)) and a 
constant domain at its other end; the constant domain of the light chain is aligned with the first 

25 constant domain of the heavy chain, and the light chain variable domain is aligned with the 
variable domain of the heavy chain. Particular amino acid residues are believed to form an 
interface between the light and heavy chain variable domains. The light chains of antibodies from 
any vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) and 
lambda (1), based on the amino acid sequences of their constant domains. Depending on the amino 

30 acid sequence of the constant domain of their heavy chains, immunoglobulins can be assigned to 
different classes. There are five major classes of human immunoglobulins: IgA, IgD, IgE, IgG and 
IgM, and several of these may be further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, 
IgG-3, and IgG-4; IgA- 1 and IgA-2. One skilled in the art would recognize the comparable classes 
for mouse. The heavy chain constant domains that correspond to the different classes of 

35 immunoglobulins are called alpha, delta, cpsilon, gamma, and mu, respectively. 
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Antibodies can be either polyclonal or monoclonal. Polyclonal antibodies, typically are 
derived from the serum of an animal that has been immunogenically challenged, and monoclonal 
antibodies are derived'as discussed herein. 

The term "variable" is used herein to describe certain portions of the variable domains that 
5 differ in sequence among antibodies and are used in the binding and specificity of each particular 
antibody for its particular antigen. However, the variability is not usually evenly distributed 
through the variable domains of antibodies. It is typically concentrated in three segments called 
complementarity determining regions (CDRs) or hypervariable regions both in the light chain and 
the heavy chain variable domains. The more highly conserved portions of the variable domains 

10 are called the framework (FR). The variable domains of native heavy and light chains each 
comprise four FR regions, largely adopting a b-sheet configuration, connected by three CDRs, 
which form loops connecting, and in some cases forming part of, the b-sheet structure. The CDRs 
in each chain are held together in close proximity by the FR regions and, with the CDRs from the 
other chain, contribute to the formation of the antigen binding site of antibodies (see Kabat E. A. 

1 5 et aL, "Sequences of Proteins of Immunological Interest,'* National Institutes of Health, Bethesda, 
Md. (1987)). The constant domains are not involved directly in binding an antibody to an antigen, 
but exhibit various effector functions, such as participation of the antibody in antibody-dependent 
cellular toxicity. 

As used herein, the term "antibody or fragments thereof encompasses chimeric antibodies 
20 and hybrid antibodies, with dual or multiple antigen or epitope specificities, and fragments, such as 
F(ab')2, Fab 1 , Fab and the like, including hybrid fragments. Thus, fragments of the antibodies that 
retain the ability to bind their specific antigens are provided. For example, fragments of antibodies 
which maintain EphA2 binding activity arc included within the meaning of the term "antibody or 
fragment thereof." Such antibodies and fragments can be made by techniques known in the art and 
25 can be screened for specificity and activity according to the methods set forth in the Examples and 
in general methods for producing antibodies and screening antibodies for specificity and activity 
(See Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New 
York, (1988)). 

Also included within the meaning of "antibody or fragments thereof are conjugates of 
30 antibody fragments and antigen binding proteins (single chain antibodies) as described, for 

example, in U.S. Pat No. 4,704,692, the contents of which are hereby incorporated by reference. 
Single chain divalent antibodies are also provided. 

Optionally, the antibodies are generated in other species and "humanized" for 
administration in humans. Humanized forms of non-human (e.g., murine) antibodies are chimeric 
35 immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab , )2, or 
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other antigen-binding subsequences of antibodies) which contain minimal sequence derived from 
non-human immunoglobulin. Humanized antibodies include human immunoglobulins (recipient 
antibody) in which residues from a complementary determining region (CDR) of the recipient are 
replaced by residues from a CDR of a non-human species (donor antibody) such as mouse, rat or 
5 rabbit having the desired specificity, affinity and capacity. In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human residues. 
Humanized antibodies may also comprise residues that are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will 
comprise substantially all of at least one, and typically two, variable domains, in which all or 

10 substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all 
or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. 
The humanized antibody optimally also will comprise at least a portion of an immunoglobulin 
constant region (Fc), typically that of a human immunoglobulin (Jones et al., Nature, 321:522-525 
(1986); Riechmannet ah, Nature, 332:323-327 (1988); and Presta, Curr. Op. Struct. Biol., 2:593- 

15 596(1992)). 

Methods for humanizing non-human antibodies are well known in the art. Generally, a 
humanized antibody has one or more amino acid residues introduced into it from a source that is 
non-human. These non-human amino acid residues are often referred to as "import" residues, 
which are typically taken from an "import" variable domain. Humanization can be essentially 

20 performed following the method of Winter and co-workers (Jones et al., Nature, 321 :522-525 
(1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 
(1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a 
human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Pat. 
No. 4,816,567), wherein substantially less than an intact human variable domain has been 

25 substituted by the corresponding sequence from a non-human species. In practice, humanized 
antibodies are typically human antibodies in which some CDR residues and possibly some FR 
residues are substituted by residues from analogous sites in rodent antibodies. 

The choice of human variable domains, both light and heavy, to be used in making the 
humanized antibodies is very important in order to reduce antigenicity. According to the u best-fif 

30 method, the sequence of the variable domain of a rodent antibody is screened against the entire 
library of known human variable domain sequences. The human sequence which is closest to that 
of the rodent is then accepted as the human framework (FR) for the humanized antibody (Sims et 
al., J. Immunol., 151:2296 (1993) and Chothia et al., J. Mol. Biol., 196:901 (1987)). Another 
method uses a particular framework derived from the consensus sequence of all human antibodies 

35 of a particular subgroup of light or heavy chains. The same framework may be used for several 
different humanized antibodies (Carter etal M Proc. Natl. Acad. Sci. USA, 89:4285 (1992); 
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Presta et a]., J. Immunol., 151:2623 (1993)). 

It is further important that antibodies be humanized with retention of high affinity for the 
antigen and other favorable biological properties. To achieve this goal, according to a preferred 
method, humanized antibodies are prepared by a process of analysis of the parental sequences and 

5 various conceptual humanized products using three dimensional models of the parental and 
humanized sequences. Three dimensional immunoglobulin models are commonly available and 
are familiar to those skilled in the art. Computer programs are available which illustrate and 
display probable three-dimensional conformational structures of selected candidate 
immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the 

10 residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of residues 
that influence the ability of the candidate immunoglobulin to bind its antigen. In this way, FR 
residues can be selected and combined from the consensus and import sequence so that the desired 
antibody characteristic, such as increased affinity for the target antigen(s), is achieved. In general, 
the CDR residues are directly and most substantially involved in influencing antigen binding (see, 

15 WO 94/04679, published 3 March 1994). 

Transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full 
repertoire of human antibodies in the absence of endogenous immunoglobulin production can be 
employed. For example, it has been described that the homozygous deletion of the antibody heavy 
chain joining region (J(H)) gene in chimeric and germ-line mutant mice results in complete 

20 inhibition of endogenous antibody production. Transfer of the human germ-line immunoglobulin 
gene array in such germ-line mutant mice will result in the production of human antibodies upon 
antigen challenge (see, e.g., Jakobovits et al., Proc. Natl Acad. Sci. USA, 90:2551-255 (1993); 
Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., Year in Immuno., 7:33 (1993)). 
Human antibodies can also be produced in phage display libraries (Hoogenboom et al., J. Mol. 

25 Biol., 227:381 (1991); Marks etal, J. Mol. BioL, 222:581 (1991)). The techniques of Cote etal. 
and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et 
ah, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boemer et ah, J. 
Immunol., 147(l):86-95 (1991)). 

The present invention further provides a hybridoma cell that produces the monoclonal 
30 antibody of the invention. The term "monoclonal antibody" as used herein refers to an antibody 
obtained from a substantially homogeneous population of antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible naturally occurring mutations that may 
be present in minor amounts. The monoclonal antibodies herein specifically include "chimeric" 
antibodies in which a portion of the heavy and/or light chain is identical with or homologous to 
35 corresponding sequences in antibodies derived from a particular species or belonging to a 
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particular antibody class or subclass, while the remainder of the chain(s) is identical with or 
homologous to corresponding sequences in antibodies derived from another species or belonging 
to another antibody class or subclass, as well as fragments of such antibodies, so long as they 
exhibit the desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et al., Proc. Natl. Acad. 
5 Sci. USA, 81:6851-6855 (1984)). 

Monoclonal antibodies of the invention may be prepared using hybridoma methods, such 
as those described by Kohler and Milstein, Nature, 256:495 (1975) or Harlow and Lane. 
Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988). In a 
hybridoma method, a mouse or other appropriate host animal, is typically immunized with an 

10 immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that 
will specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized 
in vitro. Preferably, the immunizing agent comprises EphA2. Traditionally, the generation of 
monoclonal antibodies has depended on the availability of purified protein or peptides for use as 
the immunogen. More recently DNA based immunizations have shown promise as a way to elicit 

15 strong immune responses and generate monoclonal antibodies. In this approach, DNA-based 
immunization can be used, wherein DNA encoding a portion of EphA2 expressed as a fusion 
protein with human IgGl is injected into the host animal according to methods known in the art 
(e.g., Kilpatrick KE, et al. Gene gun delivered DNA-based immunizations mediate rapid 
production of murine monoclonal antibodies to the Flt-3 receptor. Hybridoma. 1998 

20 Dec; 17(6):569-76; Kilpatrick KE et al. High-affinity monoclonal antibodies to PED/PEA- 1 5 

generated using 5 microg of DNA. Hybridoma. 2000 Aug;19(4):297-302, which are incorporated 
herein by referenced in full for the methods of antibody production). 

An alternate approach to immunizations with either purified protein or DNA is to use 
antigen expressed in baculovirus. The advantages to this system include ease of generation, high 

25 levels of expression, and post-translational modifications that are highly similar to those seen in 
mammalian systems. Use of this system involves expressing domains of EphA2 antibody as 
fusion proteins. The antigen is produced by inserting a gene fragment in-frame between the signal 
sequence and the mature protein domain of the EphA2 antibody nucleotide sequence. This results 
in the display of the foreign proteins on the surface of the virion. This method allows 

30 immunization with whole virus, eliminating the need for purification of target antigens. 

Generally, peripheral blood lymphocytes ("PBLs") are used in methods of producing 
monoclonal antibodies if cells of human origin are desired, or spleen cells or lymph node cells are 
used if non-human mammalian sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
35 hybridoma cell (Goding, "Monoclonal Antibodies: Principles and Practice" Academic Press, 
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(1986) pp. 59-103). Immortalized cell lines are usually transformed mammalian cells, including 
myeloma cells of rodent, bovine, equine, and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells may be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfused, 
5 immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically 
will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances 
prevent the growth of HGPRT-deficient cells. Preferred immortalized cell lines are those that fuse 
efficiently, support stable high level expression of antibody by the selected antibody-producing 

10 cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines 
are murine myeloma lines, which can be obtained, for instance, from the Salic Institute Cell 
Distribution Center, San Diego, Calif, and the American Type Culture Collection, Rockville, Md. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); Brodeur et 

15 al, "Monoclonal Antibody Production Techniques and Applications" Marcel Dekker, Inc., New 
York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for the 
presence of monoclonal antibodies directed against EphA2. Preferably, the binding specificity of 
monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation or 
20 by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent 
assay (ELISA). Such techniques and assays are known in the art, and are described further in the 
Examples below or in Harlow and Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor 
Publications, New York, (1988). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting 
25 dilution or FACS sorting procedures and grown by standard methods. Suitable culture media for 
this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, 
30 for example, protein A-Sepharose, protein G, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as 
those described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
35 oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
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light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 

5 obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may 
be modified, for example, by substituting the coding sequence for human heavy and light chain 
constant domains in place of the homologous murine sequences (U.S. Pat. No. 4,816,567) or by 
covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a 
non-immunoglobulin polypeptide. Optionally, such a non-immunoglobulin polypeptide is 

10 substituted for the constant domains of an antibody of the invention or substituted for the variable 
domains of one antigen-combining site of an antibody of the invention to create a chimeric 
bivalent antibody comprising one antigen-combining site having specificity for EphA2 and another 
antigen-combining site having specificity for a different antigen. 

In vilro methods are also suitable for preparing monovalent antibodies. Digestion of 
15 antibodies to produce fragments thereof, particularly, Fab fragments, can be accomplished using 

routine techniques known in the art. For instance, digestion can be performed using papain. 

Examples of papain digestion are described in WO 94/29348 published Dec. 22, 1994, U.S. Pat. 

No. 4,342,566, and Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor 

Publications, New York, (1988), Papain digestion of antibodies typically produces two identical 
20 antigen binding fragments, called Fab fragments, each with a single antigen binding site, and a 

residual Fc fragment. Pepsin treatment yields a fragment, called the F(ab')2 fragment, that has 

two antigen combining sites and is still capable of cross-linking antigen. 

The Fab fragments produced in the antibody digestion also contain the constant domains 
of the light chain and the first constant domain of the heavy chain. Fab' fragments differ from Fab 

25 fragments by the addition of a few residues at the carboxy terminus of the heavy chain domain 
including one or more cysteines from the antibody hinge region. The F(ab')2 fragment is a 
bivalent fragment comprising two Fab* fragments linked by a disulfide bridge at the hinge region. 
Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains 
bear a free thiol group. Antibody fragments originally were produced as pairs of Fab 1 fragments 

30 which have hinge cysteines between them. Other chemical couplings of antibody fragments are 
also known. 

An isolated immunogenically specific paratope or fragment of the antibody is also 
provided. A specific immunogenic epitope of the antibody can be isolated from the whole 
antibody by chemical or mechanical disruption of the molecule. The purified fragments thus 
35 obtained are tested to determine their immunogenicity and specificity by the methods taught 
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herein. Immunoreactive paratopes of the antibody, optionally, are synthesized directly. An 
immunoreactive fragment is defined as an amino acid sequence of at least about two to five 
consecutive amino acids derived from the antibody amino acid sequence. 

One method of producing proteins comprising the antibodies of the present invention is to 
5 link two or more peptides or polypeptides together by protein chemistry techniques. For example, 
peptides or polypeptides can be chemically synthesized using currently available laboratory 
equipment using either Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) 
chemistry. (Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily 
appreciate that a peptide or polypeptide corresponding to the antibody of the present invention, for 

10 example, can be synthesized by standard chemical reactions. For example, a peptide or 

polypeptide can be synthesized and not cleaved from its synthesis resin whereas the other fragment 
of an antibody can be synthesized and subsequently cleaved from the resin, thereby exposing a 
terminal group which is functionally blocked on the other fragment. By peptide condensation 
reactions, these two fragments can be covalently joined via a peptide bond at their carboxyl and 

1 5 amino termini, respectively, to form an antibody, or fragment thereof. (Grant GA (1992) Synthetic 
Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); Bodansky Maud TrostB., Ed. 

(1993) Principles of Peptide Synthesis. Springer- Verlag Inc., NY. Alternatively, the peptide or 
polypeptide is independently synthesized in vivo as described above. Once isolated, these 
independent peptides or polypeptides may be linked to form an antibody or fragment thereof via 

20 similar peptide condensation reactions. 

For example, enzymatic ligation of cloned or synthetic peptide segments allow relatively 
short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole 
protein domains (Abrahmsen L et al, Biochemistry, 30:415 1 (1991)). Alternatively, native 
chemical ligation of synthetic peptides can be utilized to synthetically construct large peptides or 

25 polypeptides from shorter peptide fragments. This method consists of a two step chemical reaction 
(Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). 
The first step is the chemoselective reaction of an unprotected synthetic peptide-alpha-thioester 
with another unprotected peptide segment containing an amino- terminal Cys residue to give a 
thioester-linked intermediate as the initial covalent product. Without a change in the reaction 

30 conditions, this intermediate undergoes spontaneous, rapid intramolecular reaction to form a native 
peptide bond at the ligation site. Application of this native chemical ligation method to the total 
synthesis of a protein molecule is illustrated by the preparation of human interleukin 8 (IL-8) 
(Baggiolini M et al. (1992) FEBS Lett. 307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 

(1994) ; Clark-Lewis I et al., Biochemistry, 30:3128 (1991); RajarathnamKetal., Biochemistry 
35 33:6623-30 (1994)). 
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Alternatively, unprotected peptide segments are chemically linked where the bond formed 
between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) 
bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique has been used to synthesize 
analogs of protein domains as well as large amounts of relatively pure proteins with full biological 
5 activity (deLisle Milton RC et al., Techniques in Protein Chemistry IV. Academic Press, New 
York, pp. 257-267(1992)). 

The invention also provides fragments of antibodies which have bioactivity. The 
polypeptide fragments of the present invention can be recombinant proteins obtained by cloning 
nucleic acids encoding the polypeptide in an expression system capable of producing the 

10 polypeptide fragments thereof, such as an adenovirus or baculovirus expression system. For 
example, one can determine the active domain of an antibody from a specific hybridoma that can 
cause a biological effect associated with the interaction of the antibody with EphA2. For example, 
amino acids found to not contribute to either the activity or the binding specificity or affinity of the 
antibody can be deleted without a loss in the respective activity. For example, in various 

15 embodiments, amino or carboxy-terminal amino acids are sequentially removed from either the 
native or the modified non-immunoglobulin molecule or the immunoglobulin molecule and the 
respective activity assayed in one of many available assays. In another example, a fragment of an 
antibody comprises a modified antibody wherein at least one amino acid has been substituted for 
the naturally occurring amino acid at a specific position, and a portion of either amino terminal or 

20 carboxy terminal amino acids, or even an internal region of the antibody, has been replaced with a 
polypeptide fragment or other moiety, such as biotin, which can facilitate in the purification of the 
modified antibody. For example, a modified antibody can be fused to a maltose binding protein, 
through either peptide chemistry or cloning the respective nucleic acids encoding the two 
polypeptide fragments into an expression vector such that the expression of the coding region 

25 results in a hybrid polypeptide. The hybrid polypeptide can be affinity purified by passing it over 
an amylose affinity solid support, and the modified antibody receptor can then be separated from 
the maltose binding region by cleaving the hybrid polypeptide with the specific protease factor Xa. 
(See, for example, New England Biolabs Product Catalog, 1996, pg. 164.). Similar purification 
procedures are available for isolating hybrid proteins from eukaryotic cells as well. 

30 The fragments, whether attached to other sequences or not, include insertions, deletions, 

substitutions, or other selected modifications of particular regions or specific amino acids residues, 
provided the activity of the fragment is not significantly altered or impaired compared to the 
nonmodified antibody or antibody fragment. These modifications can provide for some additional 
property, such as to remove or add amino acids capable of disulfide bonding, to increase its bio- 

35 longevity, to alter its secretory characteristics, etc. In any case, the fragment must possess a 
bioactive property, such as binding activity, regulation of binding at the binding domain, etc. 
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Functional or active regions of the antibody may be identified by mutagenesis of a specific region 
of the protein, followed by expression and testing of the expressed polypeptide. Such methods are 
readily apparent to a skilled practitioner in the art and can include site-specific mutagenesis of the 
nucleic acid encoding the antigen. (Zoller MJ et al. Nucl. Acids Res. 10:6487-500(1982). 

5 A variety of immunoassay formats may be used to select antibodies that selectively bind 

with a particular protein, variant, or fragment. For example, solid-phase ELISA immunoassays are 
routinely used to select antibodies selectively immunoreactive with a protein, protein variant, or 
fragment thereof. See Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor 
Publications, New York, (1988), for a description of immunoassay formats and conditions that 
10 could be used to determine selective binding. The binding affinity of a monoclonal antibody can, 
for example, be determined by the Scatchard analysis of Munson et al., Anal. Biochem., 107:220 
(1980). 

Also provided is an antibody reagent kit comprising containers of the monoclonal antibody 
or fragment thereof of the invention and one or more reagents for detecting binding of the antibody 
15 or fragment thereof to the EphA2 receptor molecule. The reagents can include, for example, 
fluorescent tags, enzymatic tags, or other tags. The reagents can also include secondary or tertiary 
antibodies or reagents for enzymatic reactions, wherein the enzymatic reactions produce a product 
that can be visualized. 

(b) Functional Nucleic Acids 
20 Functional nucleic acids can also be used to regulate the , for example, the function of the 

disclosed integrins, integrin receptors, molecules that interact with the disclosed integrin receptors, 
and molecules in the signaling pathways of the disclosed integrin receptors. 

Functional nucleic acids are nucleic acid molecules that have a specific function, such as 
binding a target molecule or catalyzing a specific reaction. Functional nucleic acid molecules can 

25 be divided into the following categories, which are not meant to be limiting. For example, 
functional nucleic acids include antisense molecules, aptamers, ribozymes, triplex forming 
molecules, and external guide sequences. The functional nucleic acid molecules can act as 
affectors, inhibitors, modulators, and stimulators of a specific activity possessed by a target 
molecule, or the functional nucleic acid molecules can possess a dc novo activity independent of 

30 any other molecules. 

Functional nucleic acid molecules can interact with any macromolecule, such as DNA, 
RNA, polypeptides, or carbohydrate chains. Thus, functional nucleic acids can interact with the 
mRNA of beta4 integrin for example, or the genomic DNA of alpha6 integrin for example, or they 
can interact with the polypeptide laminini, or the gamma2 subunit of laminin5. Often functional 
35 nucleic acids are designed to interact with other nucleic acids based on sequence homology 
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between the target molecule and the functional nucleic acid molecule. In other situations, the 
specific recognition between the functional nucleic acid molecule and the target molecule is not 
based on sequence homology between the functional nucleic acid molecule and the target 
molecule, but rather is based on the formation of tertiary structure that allows specific recognition 

5 to take place. Antisense molecules are designed to interact with a target nucleic acid molecule 
through either canonical or non-canonical base pairing. The interaction of the antisense molecule 
and the target molecule is designed to promote the destruction of the target molecule through, for 
example, RNAseH mediated RNA-DNA hybrid degradation. Alternatively the antisense molecule 
is designed to interrupt a processing function that normally would take place on the target 

10 molecule, such as transcription or replication. Antisense molecules can be designed based on the 
sequence of the target molecule. Numerous methods for optimization of antisense efficiency by 
finding the most accessible regions of the target molecule exist. Exemplary methods would be in 
vitro selection experiments and DNA modification studies using DMS and DEPC. It is preferred 
that antisense molecules bind the target molecule with a dissociation constant (k^less than 10* 6 . It 

15 is more preferred that antisense molecules bind with a k d less than 10" 8 . It is also more preferred 
that the antisense molecules bind the target molecule with a k d less than 10" 10 . It is also preferred 
that the antisense molecules bind the target molecule with a k d less than 10" 12 . A representative 
sample of methods and techniques which aid in the design and use of antisense molecules can be 
found in the following non-limiting list of United States patents: 5,135,917, 5,294,533, 5,627,158, 

20 5,641,754, 5,691,317, 5,780,607, 5,786,138, 5,849,903, 5,856,103, 5,919,772, 5,955,590, 
5,990,088, 5,994,320, 5,998,602, 6,005,095, 6,007,995, 6,013,522, 6,017,898, 6,018,042, 
6,025,198, 6,033,910, 6,040,2,96, 6,046,004, 6,046,319, and 6,057,437. 

Aptamers are molecules that interact with a target molecule, preferably in a specific way. 
Typically aptamers are small nucleic acids ranging from 15-50 bases in length that fold into 

25 defined secondary and tertiary structures, such as stem-loops or G-quartets. Aptamers can bind 
small molecules, such as ATP (United States patent 5,631,146) and theophiline (United States 
patent 5,580,737), as well as large molecules, such as reverse transcriptase (United States patent 
5,786,462) and thrombin (United States patent 5,543,293). Aptamers can bind very tightly with 
kaS from the target molecule of less than 10* ,: M. It is preferred that the aptamers bind the target 

30 molecule with a k u less than IO -6 . It is more preferred that the aptamers bind the target molecule 
with a k a less than 10" 8 . It is also more preferred that the aptamers bind the target molecule with a 
ka less than I0* to . It is also preferred that the aptamers bind the target molecule with a less than 
10~'\ Aptamers can bind the target molecule with a very high degree of specificity. For example, 
aptamers have been isolated that have greater than a 1 0000 fold difference in binding affinities 

35 between the target molecule and another molecule that differ at only a single position on the 

molecule (United States patent 5,543,293). It is preferred that the aptamer have a k d with the target 
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molecule at least 10 fold lower than the k d with a background binding molecule. It is more 
preferred that the aptamer have a k d with the target molecule at least 100 fold lower than the kj 
with a background binding molecule. It is more preferred that the aptamer have a lq with the 
target molecule at least 1000 fold lower than the k d with a background binding molecule. It is 
5 preferred that the aptamer have a kj with the target molecule at least 10000 fold lower than the k d 
with a background binding molecule. It is preferred when doing the comparison for a polypeptide 
for example, that the background molecule be a different polypeptide. For example, when 
determining the specificity of beta4 integrin aptamers, the background protein could be bovine 
serum albumin. Representative examples of how to make and use aptamers to bind a variety of 
10 different target molecules can be found in the following non-limiting list of United States patents: 
5,476,766, 5,503,978, 5,631,146, 5,731,424 , 5,780,228, 5,792,613, 5,795,721, 5,846,713, 
5,858,660 , 5,861,254, 5,864,026, 5,869,641, 5,958,691, 6,001,988, 6,011,020, 6,013,443, 
6,020,130, 6,028,186, 6,030,776, and 6,051,698. 

Ribozymes are nucleic acid molecules that are capable of catalyzing a chemical reaction, 

15 either intramolecularly or intermolecularly. Ribozymes are thus catalytic nucleic acid. It is 
preferred that the ribozymes catalyze intermolecular reactions. There are a number of different 
types of ribozymes that catalyze nuclease or nucleic acid polymerase type reactions which are 
based on ribozymes found in natural systems, such as hammerhead ribozymes, (for example, but 
not limited to the following United States patents: 5,334,711, 5,436,330, 5,616,466, 5,633,133, 

20 5,646,020, 5,652,094, 5,712,384, 5,770,715, 5,856,463, 5,861,288, 5,891,683, 5,891,684, 

5,985,621, 5,989,908, 5,998,193, 5,998,203, WO 9858058 by Ludwig and Sproat, WO 9858057 by 
Ludwig and Sproat, and WO 9718312 by Ludwig and Sproat) hairpin ribozymes (for example, but 
not limited to the following United States patents: 5,631,1 15, 5,646,031, 5,683,902, 5,712,384, 
5,856,188, 5,866,701, 5,869,339, and 6,022,962), and tetrahymena ribozymes (for example, but not 

25 limited to the following United States patents: 5,595,873 and 5,652,107). There are also a number 
of ribozymes that are not found in natural systems, but which have been engineered to catalyze 
specific reactions de novo (for example, but not limited to the following United States patents: 
5,580,967, 5,688,670, 5,807,718, and 5,910,408). Preferred ribozymes cleave RNA or DNA 
substrates, and more preferably cleave RNA substrates. Ribozymes typically cleave nucleic acid 

30 substrates through recognition and binding of the target substrate with subsequent cleavage. This 
recognition is often based mostly on canonical or non-canonical base pair interactions. This 
property makes ribozymes particularly good candidates for target specific cleavage of nucleic 
acids because recognition of the target substrate is based on the target substrates sequence. 
Representative examples of how to make and use ribozymes to catalyze a variety of different 

35 reactions can be found in the following non-limiting list of United States patents: 5,646,042, 
5,693,535, 5,731,295, 5,81 1,300, 5,837,855, 5,869,253, 5,877,021, 5,877,022, 5,972,699, 
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5,972,704, 5,989,906, and 6,017,756. 

Triplex forming functional nucleic acid molecules are molecules that can interact with 
either double-stranded or single-stranded nucleic acid. When triplex molecules interact with a 
target region, a structure called a triplex is formed, in which there are three strands of DNA 
5 forming a complex dependant on both Watson-Crick and Hoogsteen base-pairing. Triplex 

molecules are preferred because they can bind target regions with high affinity and specificity. It 
is preferred that the triplex forming molecules bind the target molecule with a 1q less than 10" & . It 
is more preferred that the triplex forming molecules bind with a less than 10" 8 . It is also more 
preferred that the triplex forming molecules bind the target moelcule with a kj less than 10" 10 . It is 
10 also preferred that the triplex forming molecules bind the target molecule with a k d less than 10" u . 
Representative examples of how to make and use triplex forming molecules to bind a variety of 
different target molecules can be found in the following non-limiting list of United States patents: 
5,176,996, 5,645,985, 5,650,316, 5,683,874, 5,693,773, 5,834,185, 5,869,246, 5,874,566, and 
5,962,426. 

15 External guide sequences (EGSs) are molecules that bind a target nucleic acid molecule 

forming a complex, and this complex is recognized by RNase P, which cleaves the target 
molecule. EGSs can be designed to specifically target a RNA molecule of choice. RNAse P aids 
in processing transfer RNA (tRNA) within a cell Bacterial RNAse P can be recruited to cleave 
virtually any RNA sequence by using an EGS that causes the target RNA:EGS complex to mimic 

20 the natural tRNA substrate. (WO 92/03566 by Yale, and Forster and Altman, Science 238:407- 
409(1990)). 

Similarly, eukaryotic EGS/RNAse P-directed cleavage of RNA can be utilized to cleave 
desired targets within eukaryotic cells. (Yuan etah, Proc. Natl. Acad. Sci. USA 89:8006-8010 
(1992); WO 93/22434 by Yale; WO 95/24489 by Yale; Yuan and Altman, EMBOJ 14: 1 59-168 
25 (1995), and Carrara etaL Proc. Natl. Acad. Sci. fUSA) 92:2627-2631 (\ 995)1. Representative 
examples of how to make and use EGS molecules to facilitate cleavage of a variety of different 
target molecules be found in the following non-limiting list of United States patents: 5,168,053, 
5,624,824, 5,683,873, 5,728,521, 5,869,248, and 5,877,162 

(c) Small molecules 

30 Small molecules can also be used to regulate, for example, the function of the disclosed 

intcgrins, integrin receptors, molecules that interact with the disclosed integrin receptors, and 
molecules in the signaling pathways of the disclosed integrin receptors. Those of skill in the art 
understand how to generate small molecules of this type, and exemplary libraries and methods for 
isolating small molecule regulators are disclosed herein. 
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b) Compositions identified bv screening with disclosed 
compositions / combinatorial chemistry 

(1) Combinatorial chemistry 

The disclosed compositions can be used as targets for any combinatorial technique to 

5 identify molecules or macromolecular molecules that interact with the disclosed compositions, 

such as beta4 integrin, alpha6 integrin, or the gamma2 subunit of lamininS, in a desired way. The 

nucleic acids, peptides, and related molecules disclosed herein can be used as targets for the 

combinatorial approaches. Also disclosed are the compositions that are identified through 

combinatorial techniques or screening techniques in which the herein disclosed compositions, for 

10 example set forth in SEQ ID NOS:l-19 or portions thereof, are used as the target or reagent in a 

combinatorial or screening protocol. 

It is understood that when using the disclosed compositions in combinatorial techniques or 
screening methods, molecules, such as macromolecular molecules, will be identified that have 
particular desired properties such as inhibition or stimulation or the target molecule's function. 
15 The molecules identified and isolated when using the disclosed compositions, such as, beta4 

integrin, alpa6 integrin, or the gamma2 subunit of lamininS, are also disclosed. Thus, the products 
produced using the combinatorial or screening approaches that involve the disclosed compositions, 
such as, beta4 integrin, alpha6 integrin, or the gamma2 subunit of lamininS, are also considered 
herein disclosed. 

20 Combinatorial chemistry includes but is not limited to all methods for isolating small 

molecules or macromolecules that are capable of binding either a small molecule or another 
macromolecule, typically in an iterative process. Proteins, oligonucleotides, and sugars are 
examples of macromolecules. For example, oligonucleotide molecules with a given function, 
catalytic or ligand-binding, can be isolated from a complex mixture of random oligonucleotides in 

25 what has been referred to as "in vitro genetics" (Szostak, TIBS 19:89, 1992). One synthesizes a 
large pool of molecules bearing random and defined sequences and subjects that complex mixture, 
for example, approximately 10 15 individual sequences in 100 ug of a 100 nucleotide RNA, to some 
selection and enrichment process. Through repeated cycles of affinity chromatography and PCR 
amplification of the molecules bound to the ligand on the solid support, Ellington and Szostak 

30 (1990) estimated that I in 10 10 RNA molecules folded in such a way as to bind a small molecule 
dyes. DNA molecules with such ligand-binding behavior have been isolated as well (Ellington and 
Szostak, 1992; Bock et al, 1992). Techniques aimed at similar goals exist |or small organic 
molecules, proteins, antibodies and other macromolecules known to those |f skill in the art. 
Screening sels of molecules for a desired activity whether based on small organic libraries, 

35 oligonucleotides, or antibodies is broadly referred to as combinatorial chemistry. Combinatorial 
techniques arc particularly suited for defining binding interactions between* molecules and for 
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isolating molecules that have a specific binding activity, often called aptamers when the 
macromolecules are nucleic acids. 

There are a number of methods for isolating proteins which either have de novo activity or 
a modified activity. For example, phage display libraries have been used to isolate numerous 
5 peptides that interact with a specific target. (See for example, United States Patent No. 6,031,071; 
5,824,520; 5,596,079; and 5,565,332 which are herein incorporated by reference at least for their 
material related to phage display and methods relate to combinatorial chemistry) 

A preferred method for isolating proteins that have a given function is described by 
Roberts and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. USA, 

10 94(23)12997-302 (1997). This combinatorial chemistry method couples the functional power of 
proteins and the genetic power of nucleic acids. An RNA molecule is generated in which a 
puromycin molecule is covalently attached to the 3 '-end of the RNA molecule. An in vitro 
translation of this modified RNA molecule causes the correct protein, encoded by the RNA to be 
translated. In addition, because of the attachment of the puromycin, a peptdyl acceptor which 

15 cannot be extended, the growing peptide chain is attached to the puromycin which is attached to 
the RNA. Thus, the protein molecule is attached to the genetic material that encodes it. Normal in 
vitro selection procedures can now be done to isolate functional peptides. Once the selection 
procedure for peptide function is complete traditional nucleic acid manipulation procedures are 
performed to amplify the nucleic acid that codes for the selected functional peptides. After 

20 amplification of the genetic material, new RNA is transcribed with puromycin at the 3 '-end, new 
peptide is translated and another functional round of selection is performed. Thus, protein 
selection can be performed in an iterative manner just like nucleic acid selection techniques. The 
peptide which is translated is controlled by the sequence of the RNA attached to the puromycin. 
This sequence can be anything from a random sequence engineered for optimum translation (i.e. 

25 no stop codons etc.) or it can be a degenerate sequence of a known RNA molecule to look for 
improved or altered function of a known peptide. The conditions for nucleic acid amplification 
and in vitro translation are well known to those of ordinary skill in the art and are preferably 
performed as in Roberts and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. 
USA, 94(23)12997-302 (1997)). 

30 Another preferred method for combinatorial methods designed to isolate peptides is 

described in Cohen et al. (Cohen B.A.,et ah, Proc. Natl. Acad. Sci. USA 95(24); 14272-7 
(1998)). This method utilizes and modifies two-hybrid technology. Yeast two-hybrid systems are 
useful for the detection and analysis of protein:protein interactions. The two-hybrid system, 
initially described in the yeast Saccharomyces cerevisiae, is a powerful molecular genetic 

35 technique for identifying new regulatory molecules, specific to the protein of interest (Fields and 
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Song, Nature 340:245-6 (1989)). Cohen ct al., modified this technology so that novel interactions 
between synthetic or engineered peptide sequences could be identified which bind a molecule of 
choice. The benefit of this type of technology is that the selection is done in an intracellular 
environment. The method utilizes a library of peptide molecules that attached to an acidic 
5 activation domain. A peptide of choice, for example a portion of beta4 or alpha6 or gamma2 or 
laminin5 is attached to a DNA binding domain of a transcriptional activation protein, such as Gal 
4. By performing the Two-hybrid technique on this type of system, molecules that bind the 
portion of beta4 or alpha6 or gamma2 or lamintnS can be identified. 

There are molecules that can act like antibodies, in that they can having varying binding 
10 specificities, that are based on a fibronectin motif. The fibronectin type III domain (FN3) is a 
small autonomous folding unit. This FN3 domain can be found in numeorus proteins that bind 
ligand, such as animal proteins. The beta-sandwich structure of FN3 closely resembles that of 
immunoglobulin domains. FN3 mutants can be isolated using combinatorial approaches disclosed 
herein, for example phage display, that bind desired targets. Typically the libraries of FN3 
15 molecules have been randomized in the two surface loops. Thus, FN3 can be used at least as a 
scaffold for engineering novel binding proteins. (Koide A, Bailey CW, Huang X, Koide S., "The 
fibronectin type III domain as a scaffold for novel binding proteins. "J Mol Biol 1998: 284,1 141- 
1151 which is herein incorporated by reference at least for material related to the fibronectin based 
novel binding proteins). 

20 Using methodology well known to those of skill in the art, in combination with various 

combinatorial libraries, one can isolate and characterize those small molecules or macromolecules, 
which bind to or interact with the desired target The relative binding affinity of these compounds 
can be compared and optimum compounds identified using competitive binding studies, which are 
well known to those of skill in the art. 

25 Techniques for making combinatorial libraries and screening combinatorial libraries to 

isolate molecules which bind a desired target are well, known to those of skill in the art. 
Representative techniques and methods can be found in but are not limited to United States patents 
5,084,824, 5,288,514, 5,449,754, 5,506,337, 5,539,083, 5,545,568, 5,556,762, 5,565,324, 
5,565,332, 5,573,905, 5,618,825, 5,619,680, 5,627,210, 5,646,285, 5,663,046, 5,670,326, 

30 5,677,195, 5,683,899, 5,688,696, 5,688,997, 5,698,685, 5,712,146, 5,721,099, 5,723,598, 
5,741,713, 5,792,431, 5,807,683, 5,807,754, 5,821,130, 5,831,014, 5,834,195, 5,834,318, 
5,834,588, 5,840,500, 5,847,150, 5,856,107, 5,856,496, 5,859,190, 5,864,010, 5,874,443, 
5,877,214, 5,880,972, 5,886,126, 5,886,127, 5,891,737, 5,916,899, 5,919,955, 5,925,527, 
5,939,268, 5,942,387, 5,945,070, 5,948,696, 5,958,702, 5,958,792, 5,962,337, 5,965,719, 

35 5,972,719, 5,976,894, 5,980,704, 5,985,356, 5,999,086, 6,001,579, 6,004,617 : 6,008,321, 
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6,017,768, 6,025,371, 6,030,917, 6,040,193, 6,045,671, 6,045,755, 6,060,596, and 6,061,636. 

Combinatorial libraries can be made from a wide array of molecules using a number of 
different synthetic techniques. For example, libraries containing fused 2,4-pyrimidinediones 
(United States patent 6,025,371) dihydrobenzopyrans (United States Patent 6,01 7,768and 
5 5,821,130), amide alcohols (United States Patent 5,976,894), hydroxy-amino acid amides (United 
States Patent 5,972,719) carbohydrates (United States patent 5,965,719), l,4-benzodiazepin-2,5- 
diones (United States patent 5,962,337), cyclics (United States patent 5,958,792), biaryl amino 
acid amides (United States patent 5,948,696), thiophenes (United States patent 5,942,387), 
tricyclic Tetrahydroquinolines (United States patent 5,925,527), benzofurans (United States patent 

10 5,919,955), isoquinolines (United States patent 5,916,899), hydantoin and thiohydantoin (United 
States patent 5,859,190), indoles (United States patent 5,856,496), imidazol-pyrido-indole and 
imidazol-pyrido-benzothiophenes (United States patent 5,856,107) substituted 2-methylene-2, 3- 
dihydrothiazoles (United States patent 5,847,150), quinolines (United States patent 5,840,500), 
PNA (United States patent 5,831,014), containing tags (United States patent 5,721,099), 

15 polyketides (United States patent 5,712,146), morpholino-subunits (United States patent 
5,698,685 and 5,506,337), sulfamides (United States patent 5,6 18,825), and benzodiazepines 
(United States patent 5,288,5 14). 

Screening molecules similar to alpha6 for inhibition of alpha6beta4 formation is a method 
of identifying and isolating desired compounds that can inhibit the formation of A6B4 receptor. 

20 For example, the disclosed compositions, such as alpha6 integrin or beta4 integrin can be used as 
targets in a selection scheme disclosed herein, and then the counter part integrin could be used as a 
competitive inhibitor to isolate the desired molecules. For example, a library of molecules could 
be incubated with beta4 integrin, which is bound to a solid support. The solid support can be 
washed to remove the unbound molecules and then the solid support can be incubated with, for 

25 example, alpha6 integrin at a concentration that will saturate all beta4 binding sites. The 

molecules which are collected in the flowthrough after washing the solid support will be enriched 
for molecules that interact with beta4 integrin in a way that is competitive to the alpha6-beta4 
interaction. Likewise, the solid support, bound with a target integrin, or more preferably a target 
integrin receptor, such as alpha6beta4 receptor, could also be washed, with for example, lamininS 

30 or the gamma2 subunit of laminin5 at a concentration that will saturate all of the gamma2 binding 
sites on the beta4 integrin. Collection of the wash under these conditions will yield a population of 
molecules enriched for molecules that competitively interact with beta4 integrin at the beta4- 
gamma2 site. Another example, is the following; bind target to solid support on microtiter plate. 
Incubate with ligand in presence of gridded subset of library members (or single compounds), 

35 wash, identify competitor by reduction of ligand binding It is understood that the exemplary 
discussions of alpha6beta4 and/or beta4 are equally applicable to alpha6, as well as other 
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alpha6betax receptors, such alpha6betal. 

Also disclosed are methods of isolating molecules that bind with a target molecule 
selected from the group consisting, B4 integrin, alpha6 integrin, and the gamma2 subunit of 
lamininS comprising 1) contacting a library of molecules with the target molecule and 2) 
5 collecting molecules that bind the target molecule producing an enriched population of molecules. 

Disclosed are methods, further comprising the step of repeating steps 1 and 2 with the 
enriched population of molecules, and/or wherein the library comprises a small molecule, peptide, 
peptide mimetic, or oligonucleotide. 

As used herein combinatorial methods and libraries included traditional screening methods 
10 and libraries as well as methods and libraries used in iterative processes. 

(2) Computer assisted design 
The disclosed compositions can be used as targets for any molecular modeling technique 
to identify either the structure of the disclosed compositions or to identify potential or actual 
molecules, such as small molecules, which interact in a desired way with the disclosed 
15 compositions. The nucleic acids, peptides, and related molecules disclosed herein can be used as 
targets in any molecular modeling program or approach. 

It is understood that when using the disclosed compositions in modeling techniques, 
molecules, such as macromolecular molecules, will be identified that have particular desired 
properties such as inhibition or stimulation or the target molecule's function. The molecules 
20 identified and isolated when using the disclosed compositions, such as, beta4, are also disclosed. 
Thus, the products produced Using the molecular modeling approaches that involve the disclosed 
compositions, such as, beta4 integrin, alpha6 integrin, or lamininS, are also considered herein 
disclosed. 

Thus, one way to isolate molecules that bind a molecule of choice is through rational 
25 design. This is achieved through structural information and computer modeling. Computer 
modeling technology allows visualization of the three-dimensional atomic structure of a selected 
molecule and the rational design of new compounds that will interact with the molecule. The 
three-dimensional construct typically depends on data from x-ray crystallographic analyses or 
NMR imaging of the selected molecule. The molecular dynamics require force field data. The 
30 computer graphics systems enable prediction of how a new compound will link to the target 
molecule and allow experimental manipulation of the structures of the compound and target 
molecule to perfect binding specificity. Prediction of what the molecule-compound interaction 
will be when small changes are made in one or both requires molecular mechanics software and 
computationally intensive computers, usually coupled with user- friendly, menu-driven interfaces 
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between the molecular design program and the user. 

Examples of molecular modeling systems are the CHARMm and QUANTA programs, 
Polygen Corporation, Waltham, MA. CHARMm performs the energy minimization and molecular 
dynamics functions. QUANTA performs the construction, graphic modeling and analysis of 
5 molecular structure. QUANTA allows interactive construction, modification, visualization, and 
analysis of the behavior of molecules with each other. 

A number of articles review computer modeling of drugs interactive with specific proteins, 
such as Rotivinen, et al., 1 988 Acta Pharmaceutical Fennica 97, 159-166; Ripka, New Scientist 54- 
57 (June 16, 1988); McKinaly and Rossmann, 1989 Amiu. Rev. Pharmacol Toxicioi 29, 111- 

10 1 22; Perry and Davies, OSAR: Quantitative Structure- Activity Relationships in Drue Desien pp. 
189-193 (Alan R. Liss, Inc. 1989); Lewis and Dean, 1989 Proc. R. Soc. Lond. 236, 125-140 
and 141-162; and, with respect to a model enzyme for nucleic acid components, Askew, et al., 
19897. Am. Chem. Soc. 111,1082-1090. Other computer programs that screen and graphically 
depict chemicals are available from companies such as BioDesign, Inc., Pasadena, CA., Allelix, 

15 Inc, Mississauga, Ontario, Canada, and Hypercube, Inc., Cambridge, Ontario. Although these are 
primarily designed for application to drugs specific to particular proteins, they can be adapted to 
design of molecules specifically interacting with specific regions of DNA or RNA, once that 
region is identified. 

Although described above with reference to design and generation of compounds which 
20 could alter binding, one could also screen libraries of known compounds, including natural 
products or synthetic chemicals, and biologically active materials, including proteins, for 
compounds which alter substrate binding or enzymatic activity. 

c) Nucleic acids 

There are a variety of molecules disclosed herein that are nucleic acid based, including for 
25 example the nucleic acids that encode, for example beta4 and the gamrna2 subunit of laminin5, as 
well as various functional nucleic acids. The disclosed nucleic acids are made up of for example, 
nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and 
other molecules are discussed herein. It is understood that for example, when a vector is expressed 
in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is 
30 understood that if, for example, an antisense molecule is introduced into a cell or cell environment 
through for example exogenous delivery, it is advantageous that the antisense molecule be made 
up of nucleotide analogs that reduce the degradation of the antisense molecule in the cellular 
environment. 

(1) Nucleotides and related molecules 
35 A nucleotide is a molecule that contains a base moiety, a sugar moiety and a phosphate 
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moiety. Nucleotides can be linked together through their phosphate moieties and sugar moieties 
creating an internucleoside linkage. The base moiety of a nucleotide can be adenin-9-yl (A), 
cytosin-l-yl (C), guanin-9-yl (G), uracil-l-yl (U), and thymin-l-yl (T). The sugar moiety of a 
nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide is pentavalent 
5 phosphate. An non-limiting example of a nucleotide would be 3 -AMP (3'-adenosine 
monophosphate) or 5'-GMP (5-guanosine monophosphate). 

A nucleotide analog is a nucleotide which contains some type of modification to either the 
base, sugar, or phosphate moieties. Modifications to the base moiety would include natural and 
synthetic modifications of A, C, G, and T/U as well as different purine or pyrimidine bases, such 

10 as uracil-5-yl (.psi.), hypoxanthin-9-yl (I), and 2-aminoadenin-9-yl. A modified base includes but 
is not limited to 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 
2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other 
alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5- 
halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5- 

15 uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8- 
substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5- 
substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8- 
azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine, 
Additional base modifications can be found for example in U.S. Pat. No. 3,687,808, Englischet 

20 al., Angewandte Chemie, International Edition, 1991, 30, 613, and Sanghvi, Y. S., Chapter 15, 
Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B. ed., CRC 
Press, 1993. Certain nucleotide analogs, such as 5 -substituted pyrimidines, 6-azapyrimidines and 
N-2, N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5- 
propynylcytosine. 5-methylcytosine can increase the stability of duplex formation. Base 

25 modifications can be combined, for example, with a sugar modification, such as 2'-0- 
methoxyethyl, to achieve unique properties such as increased duplex stability. There are 
numerous United States patents such as 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 
5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 
5,594,121, 5,596,091; 5,614,617; and 5,681,941, which detail and describe a range of base 

30 modifications. Each of these patents is herein incorporated by reference. 

Nucleotide analogs can also include modifications of the sugar moiety. Modifications to 
the sugar moiety would include natural modifications of the ribose and deoxy ribose as well as 
synthetic modifications. Sugar modifications include but are not limited to the following 
modifications at the 2' position; OH-; F-; 0-, S-, or N-alkyl; 0-, S-, or N-alkenyl; 0-, S- or N- 
35 alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or 

unsubstituted C, to C, 0f alkyl or C 2 to C, 0 alkenyl and alkynyl. T sugar modifications also include 
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but are not limited to -0[(CH 2 ) n 0] m CH 3 , -OCCH^ OCH 3 , -0(CH 2 ) n NH 2 , -0(CHJ n CH 3 , - 
0(CH 2 ) n -ONH 2 , and -0(CH : ) n ON[(CH 2 ) n CH 3 )] 2 , where n and m are from I to about 10. 

Other modifications at the 2' position include but are not limited to: C, to C )0 lower alky], 
substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or 0-aralky! 3 SH, SCH 3 , OCN, CI, Br, CN, CF 3 , 
5 OCF 3 , SOCH 3 , S0 2 CH 3 , ON0 2 , NO : , N 3 , NH 2 , heterocycloalkyl, heterocycloalkaryl, 

aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an 
intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a 
group for improving the pharmacodynamic properties of an oligonucleotide, and other substiruents 
having similar properties. Similar modifications may also be made at other positions on the sugar, 

10 particularly the 3' position of the sugar on the 3' terminal nucleotide or in 2'-5' linked 

oligonucleotides and the 5' position of 5' terminal nucleotide. Modified sugars would also include 
those that contain modifications at the bridging ring oxygen, such as CH 2 and S. Nucleotide sugar 
analogs may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl 
sugar. There are numerous United States patents that teach the preparation of such modified sugar 

15 structures such as 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 
5,514,785; 5,519,134; 5,567,81 1; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 
5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, each of which is herein incorporated 
by reference in its entirety. 

Nucleotide analogs can also be modified at the phosphate moiety. Modified phosphate 
20 moieties include but are not limited to those that can be modified so that the linkage between two 
nucleotides contains a phosphorothioate, chiral phosphorothioate, phosphorodithioate, 
phosphotriester, aminoalkylphosphotriester, methyl and other alkyl phosphonates including 3'- 
alkylene phosphonate and chiral phosphonates, phosphinates, phosphoramidates including 3- 
amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, 
25 thionoalkylphosphonates, thionoalkylphosphotriesters, and borahophosphates. It is understood 
that these phosphate or modified phosphate linkage between two nucleotides can be through a 3'-5' 
linkage or a 2'-5' linkage, and the linkage can contain inverted polarity such as 3'-5' to 5*-3* or 2'-5' 
to 5'-2\ Various salts, mixed salts and free acid forms are also included. Numerous United States 
patents teach how to make and use nucleotides containing modified phosphates and include but are 
30 not limited to, 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 
5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,1 1 1; 5,563,253; 5,571,799; 
5,587,361; and 5,625,050, each of which is herein incorporated by reference. 

It is understood that nucleotide analogs need only contain a single modification, but may 
35 also contain multiple modifications within one of the moieties or between different moieties* 
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Nucleotide substitutes are molecules having similar functional properties to nucleotides, 
but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide 
substitutes are molecules that will recognize nucleic acids in a Watson-Crick or Hoogsteen 
manner, but which are linked together through a moiety other than a phosphate moiety. 
5 Nucleotide substitutes are able to conform to a double helix type structure when interacting with 
the appropriate target nucleic acid. 

, Nucleotide substitutes are nucleotides or nucleotide analogs that have had the phosphate 
moiety and/or sugar moieties replaced. Nucleotide substitutes do not contain a standard 
phosphorus atom. Substitutes for the phosphate can be for example, short chain alkyl or 

10 cycioalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl intemucleoside 
linkages, or one or more short chain heteroatomic or heterocyclic intemucleoside linkages. These 
include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); 
siloxane backbones; sulfide, sulfoxide and sulfone backbones;formacetyl and thioformacetyl 
backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; 

15 sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and 

sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH 2 component 
parts. Numerous United States patents disclose how to make and use these types of phosphate 
replacements and include but are not limited to 5,034,506; 5,166,315; 5,185,444; 5,214,134; 
5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 

20 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 

5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which 
is herein incorporated by reference. 

It is also understood in a nucleotide substitute that both the sugar and the phosphate 
moieties of the nucleotide can be replaced, by for example an amide type linkage 
25 (aminoethylglycine) (PNA). United States patents 5,539,082; 5,714,331;and 5,719,262 teach how 
to make and use PNA molecules, each of which is herein incorporated by reference. (See also 
Nielsen et al., Science, 1991, 254, 1497-1500). 

It is also possible to link other types of molecules (conjugates) to nucleotides or nucleotide 
analogs to enhance for example, cellular uptake. Conjugates can be chemically linked to the 
30 nucleotide or nucleotide analogs. Such conjugates include but are not limited to lipid moieties 
such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 

86, 6553-6556), chotic acid (Manoharan et al., Bioorg. Med. Chem. Let, 1994, 4, 1053- 
1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660, 
306-309; Manoharan et al., Bioorg. Med. Chem. Let, 1993, 3, 2765-2770), a thiocholesterol 
35 (Oberhauscr et at, Nuct Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or 
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undecyl residues (Saison-Behmoaras etal., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS 
Lett, 1990, 259, 327-330; Svinarchuk et a!., Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di- 
hexadecyl-rac-glycerot or triethylammonium l,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate 
(Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea etal., Nucl. Acids Res., 1990, 
5 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & 
Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 
1995,36, 3651-3654), a palmityl moiety (Mishra etal., Biochim. Biophys. Acta, 1995, 1264, 
229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. 
Pharmacol. Exp. Then, 1996, 277, 923-937. Numerous United States patents teach the 

10 preparation of such conjugates and include, but are not limited to U.S. Pat. Nos. 4,828,979; 
4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 
5,580,731; 5,591,584; 5,109,124; 5,118,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 
5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 
4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,1 12,963; 5,214,136; 5,082,830; 

15 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 
5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 
5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 
5,599,928 and 5,688,941, each of which is herein incorporated by reference. 

It is understood that an oligonucleotide can be made from any combination of nucleotides, 
20 nucleotide analogs, or nucleotide substitutes disclosed herein or related molecules not specifically 
recited herein. 

A Watson-Crick interaction is at least one interaction with the Watson-Crick face of a 
nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a nucleotide, 
nucleotide analog, or nucleotide substitute includes the C2, Nl, and C6 positions of a purine based 
25 nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 positions of a 
pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute. 

A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face of a 
nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The 
Hoogsteen face includes the N7 position and reactive groups (NH2 or O) at the C6 position of 
30 purine nucleotides. 

(2) Sequences 

There are a variety of sequences related to the beta4 integrin or the Iaminin5-gamma2 gene 
or the alpha6 integrin gene having the following Genbank Accession Numbers 6453379, 4557674, 
and AH006634, respectively. These sequences and others are herein incorporated by reference in 
35 their entireties as well as for individual subsequences contained therein. 
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One particular sequence set forth in SEQ ID NO:l (beta4 integrin) and having Genbank 
accession number 6453379 is used herein, at various points, as an example, to exemplify the 
disclosed compositions and methods (or when another particular sequence is used as an example). 
It is understood that the description related to this sequence is applicable to any sequence related to 
5 beta4 integrin or any of the other molecules disclosed herein, such as alpha6 integrin, or the 
subunits of laminin5, such as gamma2, unless specifically indicated otherwise. Those of skill in 
the art understand how to resolve sequence discrepancies and differences and to adjust the 
compositions and methods relating to a particular sequence to other related sequences (i.e. 
sequences of beta4 integrin). Primers and/or probes can be designed for any beta4 integrin 
10 sequence given the information disclosed herein and known in the art. 

(3) Primers and probes 
Disclosed are compositions including primers and probes, which are capable of interacting 
with the for example, the alpha6 gene or rnRNA, beta4 gene or mRNA, or gamma2 subunit of the 
laminin5 ligand gene as disclosed herein or mRNA as wells as primers or probes for any of the 

15 sequences or fragments of the sequences, set forth in SEQ ID NOs: 1,3,5,7,9,1 1, and 13. In certain 
embodiments the primers are used to support DNA amplification reactions. Typically the primers 
will be capable of being extended in a sequence specific manner. Extension of a primer in a 
sequence specific manner includes any methods wherein the sequence and/or composition of the 
nucleic acid molecule to which the primer is hybridized or otherwise associated directs or 

20 influences the composition or sequence of the product produced by the extension of the primer. 
Extension of the primer in a sequence specific manner therefore includes, but is not limited to, 
PCR, DNA sequencing, DNA extension, DNA polymerization, RNA transcription, or reverse 
transcription. Techniques and conditions that amplify the primer in a sequence specific manner 
are preferred. In certain embodiments the primers are used for the DNA amplification reactions, 

25 such as PCR or direct sequencing. It is understood that in certain embodiments the primers can 
also be extended using non-enzymatic techniques, where for example, the nucleotides or 
oligonucleotides used to extend the primer are modified such that they will chemically react to 
extend the primer in a sequence specific manner. Typically the disclosed primers hybridize with 
the beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, for example, or region of the 

30 beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, for example, or they hybridize with 
the complement of the beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, for example, or 
complement of a region of the beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, or any 
of the sequences or fragments of the sequences, set forth in SEQ ID NOs: 1,3,5,7,9,1 1 , and 1 3, for 
example. 

35 The size of the primers or probes for interaction with the beta4 gene, alpha6 gene, or 

Iaminin5-gamma2 subunit gene, for example, in certain embodiments can be any size that supports 
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the desired enzymatic manipulation of the primer, such as DNA amplification or the simple 
hybridization of the probe or primer. A typical primer or probe for beta4 gene, alpha6 gene, or 
Iaminin5-gamma2 subunit gene, for example, or primer or probe for any of the sequences or 
fragments of the sequences, set forth in SEQ ID NOs: 1 ,3,5,7,9, 1 1 , and 13 would be at least about 
5 6,7,8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24,25,26,27,28,29,30,31,32,33, 
34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 5 1 , 52, 53, 54, 55, 56, 57, 58, 59, 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86,87, 88, 89,90,91,92, 93,94, 95, 96,97,98, 99, 100, 125, 150, 175,200, 225,250, 275, 300, 
325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 
10 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

In other embodiments a primer or probe for beta4 gene, alpha6 gene, or Iaminin5-gamma2 
subunit gene, for example, primer or probe or a primer or probe for any of the sequences or 
fragments of the sequences, set forth in SEQ ID NOs: 1,3,5,7,9,11, and 13 can be less than or equal 
to about 6, 7, 8, 9, 10, 1 1, 12 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
15 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 
57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 
83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 
275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 
1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

20 The primers for the beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, or any of 

the sequences or fragments of the sequences, set forth in SEQ ID NOs:l,3,5,7,9,l 1, and 13, for 
example, typically will be used to produce an amplified DNA product that contains a desired 
region. In general, typically the size of the product will be such that the size can be accurately 
determined to within 3, or 2 or 1 nucleotides. 

25 In certain embodiments this product is at least about 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, 

30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 
250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 

30 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

In other embodiments the product is less than or equal to about 20, 21 , 22, 23, 24, 25, 26, 
27. 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 
53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 
35 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 
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900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

d) Sequence similarities 
It is understood that as discussed herein the use of the terms homology and identity mean 
the same thing as similarity. Thus, for example, if the word homology is used between two non- 
5 natural sequences it is understood that this is not necessarily indicating an evolutionary 
relationship between these two sequences, but rather is looking at the similarity or relatedness 
between their nucleic acid sequences. Many of the methods for determining homology between 
two evolutionarily related molecules are routinely applied to any two or more nucleic acids or 
proteins for the purpose of measuring sequence similarity regardless of whether or not they are 
10 evolutionarily related. 

In general, it is understood that one way to define any known variants and derivatives or 
those that might arise, of the disclosed genes and proteins herein, is through defining the variants 
and derivatives in terms of homology to specific known sequences. This identity of particular 
sequences disclosed herein is also discussed elsewhere herein. In general, variants of genes and 

15 proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the 
stated sequence or the native sequence. Those of skill in the art readily understand how to 
determine the homology of two proteins or nucleic acids, such as genes. For example, the 
homology can be calculated after aligning the two sequences so that the homology is at its highest 

20 level. 

Another way of calculating homology can be performed by published algorithms. Optimal 
alignment of sequences for comparison may be conducted by the local homology algorithm of 
Smith and Waterman Adv. Appl. Math. 2: 482 (1 981), by the homology alignment algorithm of 
Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of 
25 Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized 

implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
30 algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger etal. Proc. Natl. Acad. ScL 
USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymoi 183:281-306, 1989 which are herein 
incorporated by reference for at least material related to nucleic acid alignment. It is understood 
that any of the methods typically can be used and that in certain instances the results of these 
various methods may differ, but the skilled artisan understands if identity is found with at least one 
35 of these methods, the sequences would be said to have the stated identity, and be disclosed herein. 
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For example, as used herein, a sequence recited as having a particular percent homology to 
another sequence refers to sequences that have the recited homology as calculated by any one or 
more of the calculation methods described above. For example, a first sequence has 80 percent 
homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 
5 percent homology to the second sequence using the Zuker calculation method even if the first 
sequence does not have 80 percent homology to the second sequence as calculated by any of the 
other calculation methods. As another example, a first sequence has 80 percent homology, as 
defined herein, to a second sequence if the first sequence is calculated to have 80 percent 
homology to the second sequence using both the Zuker calculation method and the Pearson and 

10 Lipman calculation method even if the first sequence does not have 80 percent homology to the 
second sequence as calculated by the Smith and Waterman calculation method, the Needleman and 
Wunsch calculation method, the Jaeger calculation methods, or any of the other calculation 
methods. As yet another example, a first sequence has 80 percent homology, as defined herein, to 
a second sequence if the first sequence is calculated to have 80 percent homology to the second 

15 sequence using each of calculation methods (although, in practice, the different calculation 
methods will often result in different calculated homology percentages). 

It is also understood that functional fragments as well as antigenic fragments as well as 
fragments that can be used in selection protocols of the disclosed compositions are also disclosed. 
For example, integrins have domains that interact with the other integrins. It may be advantageous 
20 in certain embodiments to utilize just the integrin binding domain fragment of, for example, the 
beta4 integrin, in a selection protocol disclosed herein. By using this domain of the beta4 integrin 
as the selection target, for example, the selection protocol will be biased for molecules binding this 
domain of beta4 integrin. 

e) Hybridization/selective hybridization 

25 The term hybridization typically means a sequence driven interaction between at least two 

nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven interaction means 
an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives 
in a nucleotide specific manner. For example, G interacting with C or A interacting with T are 
sequence driven interactions. Typically sequence driven interactions occur on the Watson-Crick 

30 face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a 
number of conditions and parameters known to those of skill in the art. For example, the salt 
concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules 
will hybridize. 

Parameters for selective hybridization between two nucleic acid molecules are well known 
35 to those of skill in the art. For example, in some embodiments selective hybridization conditions 
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can be defined as stringent hybridization conditions. For example, stringency of hybridization is 
controlled by both temperature and salt concentration of either or both of the hybridization and 
washing steps. For example, the conditions of hybridization to achieve selective hybridization 
may involve hybridization in high ionic strength solution (6X SSC or 6X SSPE) at a temperature 
5 that is about 12-25°C below the Tm (the melting temperature at which half of the molecules 
dissociate from their hybridization partners) followed by washing at a combination of temperature 
and salt concentration chosen so that the washing temperature is about 5°C to 20°C below the Tm. 
The temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a labeled 

10 nucleic acid of interest and then washed under conditions of different stringencies. Hybridization 
temperatures are typically higher for DNA-RNA and KNA-RNA hybridizations. The conditions 
can be used as described above to achieve stringency, or as is known in the art. (Sarnbrook et al. f 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989; Kunkel etal. Methods Enzymol. 1987:154:367, 1987 which is herein 

IS incorporated by reference for material at least related to hybridization of nucleic acids). A 
preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68°C 
(in aqueous solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of 
hybridization and washing, if desired, can be reduced accordingly as the degree of 
complementarity desired is decreased, and further, depending upon the G-C or A-T richness of any 

20 area wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
desired, can be increased accordingly as homology desired is increased, and further, depending 
upon the G-C or A-T richness of any area wherein high homology is desired, all as known in the 
art. 

Another way to define selective hybridization is by looking at the amount (percentage) of 
25 one of the nucleic acids bound to the other nucleic acid. For example, in some embodiments 
selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 
77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent 
of the limiting nucleic acid is bound to the non-limiting nucleic acid. Typically, the non-limiting 
primer is in for example, 10 or 100 or 1000 fold excess. This type of assay can be performed at 
30 under conditions where both the limiting and non-limiting primer are for example, 10 fold or 100 
fold or 1000 fold below their kj, or where only one of the nucleic acid molecules is 10 fold or 100 
fold or 1000 fold or where one or both nucleic acid molecules are above their Iq. 

Another way to define selective hybridization is by looking at the percentage of primer 
that gets enzymatically manipulated under conditions where hybridization is required to promote 
35 the desired enzymatic manipulation. For example, in some embodiments selective hybridization 
conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81. 82, 
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83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the primer is 
enzymatically manipulated under conditions which promote the enzymatic manipulation, for 
example if the enzymatic manipulation is DNA extension, then selective hybridization conditions 
would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
5 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the primer molecules are 
extended. Preferred conditions also include those suggested by the manufacturer or indicated in 
the art as being appropriate for the enzyme performing the manipulation. 

Just as with homology, it is understood that there are a variety of methods herein disclosed 
for determining the level of hybridization between two nucleic acid molecules. It is understood 
1 0 that these methods and conditions may provide di fferent percentages of hybridization between two 
nucleic acid molecules, but unless otherwise indicated meeting the parameters of any of the 
methods would be sufficient. For example if 80% hybridization was required and as long as 
hybridization occurs within the required parameters in any one of these methods it is considered 
disclosed herein. 

15 It is understood that those of skill in the art understand that if a composition or method 

meets any one of these criteria for determining hybridization either collectively or singly it is a 
composition or method that is disclosed herein. 

fi Delivery of the compositions to cells 
The disclosed compositions and methods often entail delivery of the compositions to cells. 
20 For example, antisense molecules directed to alpha6 mKNA or gamma2 mRNA can be delivered 
to cells via any method. A number of exemplary methods are disclosed herein. It is also 
understood that in certain embodiments, non-nucleic acid molecules will be and can be delivered 
to cells, for example an antibody to beta4 integrin, alpha6 integrin or, gamma2, or a small 
molecule, or a peptide. Delivery of these molecules can occur by any means, and exemplary 
25 compositions and methods for such delivery are disclosed herein. 

There are a number of compositions and methods which can be used to deliver nucleic 
acids to cells, either in vitro or in vivo. These methods and compositions can largely be broken 
down into two classes: viral based delivery systems and non-viral based delivery systems. For 
example, the nucleic acids can be delivered through a number of direct delivery systems such as, 

30 electroporation, lipofection, calcium phosphate precipitation, plasmids, viral vectors, viral nucleic 
acids, phage nucleic acids, phages, cosmids, or via transfer of genetic material in cells or carriers 
such as cationic liposomes. Appropriate means for Iransfection, including viral vectors, chemical 
transfectants, or physico-mechanical methods such as electroporation and direct diffusion of DNA, 
are described by, for example, Wolff, J. A., et al., Science, 247, 1465-1468, (1990); and Wolff, J, 

35 A. Nature, 352, 815-818, (1991). Such methods are well known in the art and readily adaptable for 
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use with the compositions and methods described herein. In certain cases, the methods will be 
modified to specifically function with large DNA molecules. Further, these methods can be used 
to target certain diseases and cell populations by using the targeting characteristics of the carrier 

(1) Nucleic acid based delivery systems 
5 Transfer vectors can be any nucleotide construction used to deliver genes into cells (e.g., a 

plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or 
adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). 

As used herein, plasmid or viral vectors are agents that transport the disclosed nucleic 
acids, such as gamma2 antisense producing molecules into the cell without degradation and 

10 include a promoter yielding expression of the gene in the cells into which it is delivered. In some 
embodiments the delivery vectors are derived from either a virus or a retrovirus. Viral vectors are, 
for example, Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS 
virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV 
backbone. Also preferred are any viral families which share the properties of these viruses which 

1 5 make them suitable for use as vectors. Retroviruses include Murine Moloney Leukemia virus, 
MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral 
vectors are able to carry a larger genetic payload, i.e., a transgene or marker gene, than other viral 
vectors, and for this reason are a commonly used vector. However, other than Lentivirus vectors , 
they are typically not as useful in non-proliferating cells. Adenovirus vectors are relatively stable 

20 and easy to work with, have high titers, and can be delivered in aerosol formulation, and can 
transfect non-dividing cells. Pox viral vectors are large and have several sites for inserting genes, 
they are thermostable and cari be stored at room temperature. A preferred embodiment is a viral 
vector which has been engineered so as to suppress the immune response of the host organism, 
elicited by the viral antigens. Preferred vectors of this type will carry coding regions for 

25 Interleukin 8 or 10. 

Viral vectors can have higher transaction (ability to introduce genes) abilities than 
chemical or physical methods to introduce genes into cells. Typically, viral vectors contain, 
nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted 
terminal repeats necessary for replication and encapsidation, and promoters to control the 

30 transcription and replication of the viral genome. When engineered as vectors, viruses typically 
have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into 
the viral genome in place of the removed viral DNA. Constructs of this type can carry up to about 
8 kb of foreign genetic material. The necessary functions of the removed early genes are typically 
supplied by cell lines which have been engineered to express the gene products of the early genes 

35 in trans. 
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(a) Retroviral Vectors 
A retrovirus is an animal virus belonging to the virus family of Retroviridae, including any 
types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are described by Verma, 
LM., Retroviral vectors for gene transfer. In Microbiology- 1985, American Society for 
5 Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. 
Examples of methods for using retroviral vectors for gene therapy are described in U.S. Patent 
Nos. 4,868,116 and 4,980,286; PCT applications WO 90/02806 and WO 89/07136; and Mulligan, 
(Science 260:926-932 (1993)); the teachings of which are incorporated herein by reference. 

A retrovirus is essentially a package which has packed into it nucleic acid cargo. The 

10 nucleic acid cargo carries with it a packaging signal, which ensures that the replicated daughter 
molecules will be efficiently packaged within the package coat, in addition to the package signal, 
there are a number of molecules which are needed in cis, for the replication, and packaging of the 
replicated virus. Typically a retroviral genome, contains the gag, pol, and env genes which are 
involved in the making of the protein coat It is the gag, pol, and env genes which are typically 

1 5 replaced by the foreign DNA that it is to be transferred to the target cell. Retrovirus vectors 
typically contain a packaging signal for incorporation into the package coat, a sequence which 
signals the start of the gag transcription unit, elements necessary for reverse transcription, 
including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat 
sequences that guide the switch of RNA strands during DNA synthesis, a purine rich sequence 5' to 

20 the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, 
and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the 
retrovirus to insert into the host genome. The removal of the gag, pol, and env genes allows for 
about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed, 
and upon replication be packaged into a new retroviral particle. This amount of nucleic acid is 

25 sufficient for the delivery of a one to many genes depending on the size of each transcript. It is 
preferable to include either positive or negative selectable markers along with other genes in the 
insert 

Since the replication machinery and packaging proteins in most retroviral vectors have 
been removed (gag, pol, and env), the vectors are typically generated by placing them into a 

30 packaging cell line. A packaging cell line is a cell line which has been transfected or transformed 
with a retrovirus that contains the replication and packaging machinery, but lacks any packaging 
signal. When the vector carrying the DNA of choice is transfected into these cell lines, the vector 
containing the gene of interest is replicated and packaged into new retroviral particles, by the 
machinery provided in cis by the helper cell. The genomes for the machinery are not packaged 

35 because they lack the necessary signals. 
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(b) Adenoviral Vectors 

The construction of replication-defective adenoviruses has been described (Berkner et al., 
J. Virology 61:1213-1220 (1987); MassieetaL, Mol. Cell. Biol. 6:2872-2883 (1986); Haj- 
Ahmadetal., J. Virology 57:267-274 (1986); Davidson etal., J. Virology 61:1226-1239 (1987); 
5 Zhang "Generation and identification of recombinant adenovirus by liposome-mediated 

transfection and PCR analysis" BioTechniques 15:868-872 (1993)). The benefit of the use of 
these viruses as vectors is that they are limited in the extent to which they can spread to other cell 
types, since they can replicate within an initial infected cell, but are unable to form new infectious 
viral particles. Recombinant adenoviruses have been shown to achieve high efficiency gene 

10 transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, 
CNS parenchyma and a number of other tissue sites (Morsy, J. Clin. Invest 92:1580-1586 
(1993); Kirshenbaum, J. Clin. Invest. 92:381-387 (1993); Roessler, J. Clin. Invest. 92:1085- 
1092 (1993); Mouliier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); 
Gomez-Foix, J. Biol. Chem. 267:25129-25134 (1992); Rich, Human Gene Therapy 4:461-476 

15 (1993);Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 
(1993); Bout, Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 (1993); Caillaud, 
Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-507 (1993)). 
Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, 
after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild 

20 type or replication-defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); 
Brown and Burlingham, J. Virology 12:386-396(1973); Svensson and Persson, J. Virology 
55:442-449 (1985); Seth, et al., J. Virol. 51:650-655 (1984); Seth, etal., Mol. Cell. Biol. 
4:1528-1533 (1984); Varga etal., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 
73:309-319(1993)). 

25 A viral vector can be one based on an adenovirus which has had the El gene removed and 

these virons are generated in a cell line such as the human 293 cell line. In another preferred 
embodiment both the El and E3 genes are removed from the adenovirus genome. 

(c) Adeno-associated viral vectors 

Another type of viral vector is based on an adeno-associated virus (AAV). This defective 
30 parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to 
humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably 
insert into chromosome 19. Vectors which contain this site specific integration property are 
preferred. An especially preferred embodiment of this type of vector is the P4.1 C vector produced 
by Avigen, San Francisco, CA, which can contain the herpes simplex virus thymidine kinase gene, 
35 HS V-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP. 
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In another type of AAV virus, the AAV contains a pair of inverted terminal repeats (ITRs) 
which flank at least one cassette containing a promoter which directs cell-specific expression 
operably linked to a heterologous gene. Heterologous in this context refers to any nucleotide 
sequence or gene which is not native to the AAV or B 19 parvovirus. 

5 Typically the AAV and B19 coding regions have been deleted, resulting in a safe, 

noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and site-specific 
integration, but not cytotoxicity, and the promoter directs cell-specific expression. United states 
Patent No. 6,261,834 is herein incorporated by reference for material related to the AAV vector. 

The vectors of the present invention thus provide DNA molecules which are capable of 
10 integration into a mammalian chromosome without substantial toxicity. 

The inserted genes in viral and retroviral usually contain promoters, and/or enhancers to 
help control the expression of the desired gene product. A promoter is generally a sequence or 
sequences of DNA that function when in a relatively fixed location in regard to the transcription 
start site. A promoter contains core elements required for basic interaction of RNA polymerase 
15 and transcription factors, and may contain upstream elements and response elements. 

(d) Large pavload viral vectors 
Molecular genetic experiments with large human herpesviruses have provided a means 
whereby large heterologous DNA fragments can be cloned, propagated and established in cells 
permissive for infection with herpesviruses (Sun et al., Nature genetics 8: 33-41, 1994; Cotter and 

20 Robertson,.Curr Opin Mol Ther 5: 633-644, 1999). These large DNA viruses (herpes simplex 
virus (HSV) and Epstein-Bar* virus (EBV), have the potential to deliver fragments of human 
heterologous DNA > 1 50 kb to specific cells. EBV recombinants can maintain large pieces of 
DNA in the infected B-cells as episomal DNA. Individual clones carried human genomic inserts 
up to 330 kb appeared genetically stable The maintenance of these episomes requires a specific 

25 EBV nuclear protein, EBNA1, constitutively expressed during infection with EBV. Additionally, 
these vectors can be used for transfection, where large amounts of protein can be generated 
transiently in vitro. Herpesvirus amplicon systems are also being used to package pieces of DNA 
> 220 kb and to infect cells that can stably maintain DNA as episomes. 

Other useful systems include, for example, replicating and host-restricted non-replicating 
30 vaccinia virus vectors. 

(2) Non-nucleic acid based systems 
The disclosed compositions can be delivered to the target cells in a variety of ways. For 
example, the compositions can be delivered through electroporalion, or through lipofection, or 
through calcium phosphate precipitation. The delivery mechanism chosen will depend in part on 
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the type of cell targeted and whether the delivery is occurring for example in vivo or in vitro. 

Thus, the compositions can comprise, in addition to the disclosed nucleic acids and 
proteins or vectors for example, lipids such as liposomes, such as cationic liposomes (e.g., 
DOTMA, DOPE, DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins 
5 to facilitate targeting a particular cell, if desired. Administration of a composition comprising a 
compound and a cationic liposome can be administered to the blood afferent to a target organ or 
inhaled into the respiratory tract to target cells of the respiratory tract. Regarding liposomes, see, 
e.g.,Brighametai.v4w. J. Resp. Cell Mol Biol 1:95-100(1989); Feigner etaLP/w. Natl 
Acad. Sci USA 84:7413-7417(1987); U.S. Pat. No.4,897,355. Furthermore, the compound can 
10 be administered as a component of a microcapsule that can be targeted to specific cell types, such 
as macrophages, or where the diffusion of the compound or delivery of the compound from the 
microcapsule is designed for a specific rate or dosage. 

In the methods described above which include the administration and uptake of exogenous 
DNA into the cells of a subject (i.e., gene transduction or trans fection), delivery of the 

15 compositions to cells can be via a variety of mechanisms. As one example, delivery can be via a 
liposome, using commercially available liposome preparations such as LEPOFECTIN, 
LIPOFECT AMINE (GIBCO-BRL, Inc., Gaithersburg, MD), SUPERFECT (Qiagen, Inc. Hilden, 
Germany) and TRANSFECTAM (Promega Biotec, Inc., Madison, Wl), as well as other liposomes 
developed according to procedures standard in the art. In addition, the nucleic acid or vector of 

20 this invention can be delivered in vivo by electroporation, the technology for which is available 
from Genetronics, Inc. (San Diego, CA) as well as by means of a SONOPORATION machine 
(IrnaRx Pharmaceutical Corp., Tucson, AZ). 

The materials may be in solution or suspension (for example, incorporated into 
microparticles, liposomes, or cells). These may be targeted to a particular cell type via antibodies, 

25 receptors, or receptor ligands. The following references are examples of the use of this technology 
to target specific proteins to tumor tissue (Sentcr, ct al., Bioconiugate Chem. . 2:447-451, (1991); 
Bagshawe, K.D.. Br. J. Cancer. 60:275-281. (1989); Bagshawe, etal., Br. J. Cancer . 58:700-703, 
(1988); Senter, et al., Bioconiugate Chem. . 4:3-9, (1993); Battelli, et al., Cancer Immunol. 
Immunother. . 35:421-425, (1992); Pieterszand McKenzie. Immunolog. Reviews. 129:57-80, 

30 (1992); and Roffler, et al., Biochem. Pharmacol . 42:2062-2065, (1991)). These techniques can be 
used for a variety of other specific cell types. Vehicles, among others, include "stealth" and other 
antibody conjugated liposomes (including lipid mediated drug targeting to colonic carcinoma), 
receptor mediated targeting of DNA through cell specific ligands, lymphocyte directed tumor 
targeting, and highly specific therapeutic retroviral targeting of murine glioma cells in vivo. The 

35 following references are examples of the use of this technology to target specific proteins to tumor 
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tissue (Hughes et al., Cancer Research , 49:6214-6220, (1989); and Litzinger and Huang, 
Biochimica et Biophvsica Acta . 1 104:179-187, (1992)). In general, receptors are involved in 
pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in clathrin- 
coated pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which 
5 the receptors are sorted, and then either recycle to the cell surface, become stored intracellularly, 
or are degraded in lysosomes. The internalization pathways serve a variety of functions, such as 
nutrient uptake, removal of activated proteins, clearance of macromolecules, opportunistic entry of 
viruses and toxins, dissociation and degradation of ligand, and receptor-level regulation. Many 
receptors follow more than one intracellular pathway, depending on the cell type, receptor 
10 concentration, type of ligand, ligand valency, and ligand concentration. Molecular and cellular 
mechanisms of receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and 
Cell Biology 10:6, 399-409 (1991)). 

Nucleic acids that are delivered to cells which are to be integrated into the host cell 
genome, typically contain integration sequences. These sequences are often viral related 
15 sequences, particularly when viral based systems are used. These viral integration systems can 
also be incorporated into nucleic acids which are to be delivered using a non-nucleic acid based 
system of deliver, such as a liposome, so that the nucleic acid contained in the delivery system can 
be come integrated into the host genome. 

Other general techniques for integration into the host genome include, for example, 
20 systems designed to promote homologous recombination with the host genome. These systems 
typically rely on sequence flanking the nucleic acid to be expressed that has enough homology 
with a target sequence within the host cell genome that recombination between the vector nucleic 
acid and the target nucleic acid takes place, causing the delivered nucleic acid to be integrated into 
the host genome. These systems and the methods necessary to promote homologous 
25 recombination are known to those of skill in the art. 

(3) In vivo/ex vivo 

As described above, the compositions can be administered in a pharmaceutical^ 
acceptable carrier and can be delivered to the subject§s cells in vivo and/or ex vivo by a variety of 
mechanisms well known in the art (e.g., uptake of naked DNA, liposome fusion, intramuscular 
30 injection of DNA via a gene gun, endocytosis and the like). 

If ex vivo methods are employed, cells or tissues can be removed and maintained outside 
the body according to standard protocols well known in the art. The compositions can be 
introduced into the cells via any gene transfer mechanism, such as, for example, calcium 
phosphate mediated gene delivery, electroporation, microinjection or proteoliposomes. The 
35 transduced cells can then be infused (e.g., in a pharmaceutical^ acceptable carrier) or 
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hornotopically transplanted back into the subject per standard methods for the cell or tissue type. 
Standard methods are known for transplantation or infusion of various cells into a subject. 

It is understood that in certain embodiments, constructs which produce an integrin signal 
transduction inhibitor are driven by inducible promoters, rather than constitutive promoters. 
5 inducible systems provide certain advantages, to the expression of the disclosed constructs. Any 
inducible system can be used. Also disclosed are cells containing the inducible systems, described 
herein, and in the Examples. These cells, can be used as model systems in a wide variety of 
assays, as well as in vivo settings. 

q) Expression systems 

10 The nucleic acids that are delivered to cells typically contain expression controlling 

systems. For example, the inserted genes in viral and retroviral systems usually contain promoters, 
and/or enhancers to help control the expression of the desired gene product. A promoter is 
generally a sequence or sequences of DNA that function when in a relatively fixed location in 
regard to the transcription start site. A promoter contains core elements required for basic 

1 5 interaction of RNA polymerase and transcription factors, and may contain upstream elements and 
response elements. 

(1) Viral Promoters and Enhancers 
Preferred promoters controlling transcription from vectors in mammalian host cells may 
be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian 

20 Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, 
or from heterologous mammalian promoters, e.g. beta actin promoter. The early and late 
promoters of the SV40 virus arc conveniently obtained as an SV40 restriction fragment which also 
contains the SV40 viral origin of replication (Ficrs et al., Nature . 273: 1 13 (1978)). The 
immediate early promoter of the human cytomegalovirus is conveniently obtained as a HindlllE 

25 restriction fragment (Greenway, P.J. et al., Gene 18: 355-360(1982)). Of course, promoters 
from the host cell or related species also are useful herein. 

Enhancer generally refers to a sequence of DNA that functions at no fixed distance from 
the transcription start site and can be either 5* (Laimins, L. et al., Proc. Natl. Acad. Sci, 78: 
993 (1981)) or 3 1 (Lusky, M.L., et al., Mol Cell Bio. 3: 1 108 (1983)) to the transcription unit. 

30 Furthermore, enhancers can be within an intron (Banerji, J.L. et al., Cdl 33: 729 (1983)) as well 
as within the coding sequence itself (Osborne, T.F., et al., Mol. Cell Bio. 4: 1293 (1984)). They 
are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to 
increase transcription from nearby promoters. Enhancers also often contain response elements that 
mediate the regulation of transcription. Promoters can also contain response elements that mediate 

35 the regulation of transcription. Enhancers often determine the regulation of expression of a gene. 
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While many enhancer sequences are now known from mammalian genes (globin, elastase, 
albumin, a- fetoprotein and insulin), typically one will use an enhancer from a eukaryotic cell virus 
for general expression.- Preferred examples are the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
5 enhancer on the late side of the replication origin, and adenovirus enhancers. 

The promoter and/or enhancer may be specifically activated either by light or specific 
chemical events which trigger their function. Systems can be regulated by reagents such as 
tetracycline and dexamethasone. There are also ways to enhance viral vector gene expression by 
exposure to irradiation, such as gamma irradiation, or alkylating chemotherapy drugs. 

1 0 In certain embodiments the promoter and/or enhancer region can act as a constitutive 

promoter and/or enhancer to maximize expression of the region of the transcription unit to be 
transcribed. In certain constructs the promoter and/or enhancer region be active in all eukaryotic 
cell types, even if it is only expressed in a particular type of cell at a particular time. A preferred 
promoter of this type is the CMV promoter (650 bases). Other preferred promoters are SV40 

1 5 promoters, cytomegalovirus (full length promoter), and retroviral vector LTF. 

It has been shown that all specific regulatory elements can be cloned and used to construct 
expression vectors that are selectively expressed in specific ceil types such as melanoma cells. 
The glial fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in 
cells of glial origin. 

20 Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human 

or nucleated cells) may also contain sequences necessary for the termination of transcription which 
may affect mRNA expression. These regions are transcribed as polyadenylated segments in the 
untranslated portion of the mRNA encoding tissue factor protein. The 3* untranslated regions also 
include transcription termination sites. It is preferred that the transcription unit also contain a 

25 polyadenylation region. One benefit of this region is that it increases the likelihood that the 
transcribed unit will be processed and transported like mRNA. The identification and use of 
polyadenylation signals in expression constructs is well established. It is preferred that 
homologous polyadenylation signals be used in the transgene constructs. In certain transcription 
units, the polyadenylation region is derived from the SV40 early polyadenylation signal and 

30 consists of about 400 bases. It is also preferred that the transcribed units contain other standard 
sequences alone or in combination with the above sequences improve expression from, or stability 
of, the construct. 

(2) Markers 

The viral vectors can include nucleic acid sequence encoding a marker product. This 
35 marker product is used to determine if the gene has been delivered to the cell and once delivered is 

48 



WO 02/30465 



PCT/US01/32127 



being expressed. Preferred marker genes are the E. Coli lacZ gene, which encodes B- 
galactosidase, and green fluorescent protein. 

In some embodiments the marker may be a selectable marker. Examples of suitable 
selectable markers for mammalian cells are dihydro folate reductase (DHFR), thymidine kinase, 
5 neomycin, neomycin analog G418, hydromycin, and puromycin. When such selectable markers 
are successfully transferred into a mammalian host cell, the transformed mammalian host cell can 
survive if placed under selective pressure. There are two widely used distinct categories of 
selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell 
line which lacks the ability to grow independent of a supplemented media. Two examples are: 

1 0 CHO DHFR- cells and mouse LTK- cells. These cells lack the ability to grow without the 
addition of such nutrients as thymidine or hypoxanthine. Because these cells lack certain genes 
necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing 
nucleotides are provided in a supplemented media. An alternative to supplementing the media is 
to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their 

1 5 growth requirements. Individual cells which were not transformed with the DHFR or TK gene 
will not be capable of survival in non-supplemented media. 

The second category is dominant selection which refers to a selection scheme used in any 
cell type and does not require the use of a mutant cell line. These schemes typically use a drug to 
arrest growth of a host cell. Those cells which have a novel gene would express a protein 

20 conveying drug resistance and would survive the selection. Examples of such dominant selection 
use the drugs neomycin, (Southern P. and Berg. P.. J. Molec. Appl. Genet. 1: 327 (1982)), 
mycophenolic acid, (Mulligan, R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, 
(Sugden, B. et al., Mol. Cell Biol. 5: 410-413 (1985)). The three examples employ bacterial 
genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin 

25 (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others include the neomycin 
analog G418 and puromycin. 

h) Peptides 

(1) Protein variants 

As discussed herein there are numerous variants of the beta4 integrin protein, alpha6 
30 integrin protein, and gamma2 laminin5 protein, for example, that are known and herein 

contemplated. In addition, to the known functional homologue variants there are derivatives of the 
beta4, alpha6, and gamma2, and other disclosed proteins which also function in the disclosed 
methods and compositions. Protein variants and derivatives are well understood to those of skill in 
the art and in can involve amino acid sequence modifications. For example, amino acid sequence 
35 modifications typically fall into one or more of three classes: substitutional, insertional or 
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deletional variants. Insertions include amino and/or carboxyl terminal fusions as well as 
intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be 
smaller insertions tharr those of amino or carboxyl terminal fusions, for example, on the order of 
one to four residues. Immunogenic fusion protein derivatives are made by fusing a polypeptide 

5 sufficiently large to confer immunogenicity to the target sequence by cross-linking in vitro or by 
recombinant cell culture transformed with DNA encoding the fusion. Deletions are characterized 
by the removal of one or more amino acid residues from the protein sequence. Typically, no more 
than about from 2 to 6 residues are deleted at any one site within the protein molecule. These 
variants ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding 

10 the protein, thereby producing DNA encoding the variant, and thereafter expressing the DNA in 
recombinant cell culture. Techniques for making substitution mutations at predetermined sites in 
DNA having a known sequence arc well known, for example Ml 3 primer mutagenesis and PCR 
mutagenesis. Amino acid substitutions are typically of single residues, but can occur at a number 
of different locations at once; insertions usually will be on the order of about from 1 to 10 amino 

15 acid residues; and deletions will range about from 1 to 30 residues. Deletions or insertions 
preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. 
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final 
construct. The mutations must not place the sequence out of reading frame and preferably will not 
create complementary regions that could produce secondary mRNA structure. Substitutional 

20 variants are those in which at least one residue has been removed and a different residue inserted 
in its place. Such substitutions generally are made in accordance with the following Tables i and 
2 and are referred to as conservative substitutions. 

TABLE 1 : Amino Acid Abbreviations 



Amino Acid 


Abbreviations 




25 alanine 


Ala 


A 


ailosoleucine 


Alle 




arginine 


Arg 


R 


asparagine 


Asn 


N 


aspartio acid 


Asp 


D 


30 cysteine 


Cys 


C 


glutamic acid 


Glu 


E 


glutamine 


Gin 


K 


glycine 


Gly 


G 


histidine 


His 


H 


35 isolelucine 


lie 


1 


leucine 


Leu 


L 


lysine 


Lys 


K 


phenylalanine 


Phe 


F 


proline 


Pro 


P 


40 pyroglutamic acid 


pGlu 




serine 


Ser 


S 


threonine 


Thr 


T 
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tyrosine Tyr Y 

tryptophan Trp W 

valine Val V 



5 TABLE 2: Amino Acid Substitutions 

Original Residue Exemplary Conservative Substitutions, others are known in the art. 

Ala ser 

Arg lys, gin 

Asn gin; his 

10 Asp glu 

Cys ser 

Gin asn, lys 

Glu asp 

Gly pro 

15 His asn;gln 

He leu; val 

Leu ile; val 

Lys arg; gin; 

Met Leu; ile 

20 Phe met; leu; tyr 

Ser thr 

Thr ser 

Trp tyr 

Tyr trp; phe 

25 Val ile; leu 



Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ 
more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in 
the area of the substitution, for example as a sheet or helical conformation, (b) the charge or 

30 hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The substitutions 
which in general are expected to produce the greatest changes in the protein properties will be 
those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a 
hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline 
is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., 

35 lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or 
aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) 
one not having a side chain, e.g., glycine, in this case, (e) by increasing the number of sites for 
sulfation and/or glycosylation. 

For example, the replacement of one amino acid residue with another that is biologically 
40 and/or chemically similar is known to those skilled in the art as a conservative substitution. For 
example, a conservative substitution would be replacing one hydrophobic residue for another, or 
one polar residue for another. The substitutions include combinations such as, for example, Gly, 
Ala; Val, Ile, Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively 
substituted variations of each explicitly disclosed sequence are included within the mosaic 
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polypeptides provided herein. 

Substitutional or deletional mutagenesis can be employed to insert sites for N- 
glycosylation (Asn-X-thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other 
labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, e.g. 
5 Arg, is accomplished for example by deleting one of the basic residues or substituting one by 
glutaminyl or histidyl residues. 

Certain post-translational derivatizations are the result of the action of recombinant host 
cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post- 
- translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, 
10 these residues are deamidated under mildly acidic conditions. Other post-translational 

modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of 
seryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and histidine side 
chains (T.E. Creighton, Proteins: Structure and Molecular Properties, W. H. Freeman & Co., 
San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in some instances, 
1 5 amidation of the C-terminal carboxyl. 

It is understood that one way to define the variants and derivatives of the disclosed 
proteins herein is through defining the variants and derivatives in terms of homology/identity to 
specific known sequences. For example, SEQ ID NO:SEQ ID NO:5 sets forth a particular 
sequence of beta4 integrin cDNA and SEQ ID NO: 6 sets forth a particular sequence of a beta4 
20 integrin protein. Specifically disclosed are variants of these and other proteins herein disclosed 
which have at least, 70% or 75% or 80% or 85% or 90% or 95% homology to the stated sequence. 
Those of skill in the art readily understand how to determine the homology of two proteins. For 
example, the homology can be calculated after aligning the two sequences so that the homology is 
at its highest level. 

25 Another way of calculating homology can be performed by published algorithms. Optimal 

alignment of sequences for comparison may be conducted by the local homology algorithm of 
Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of 
Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of 
Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized 

30 implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger etal. Proc. Natl Acad. ScL 
35 USA 86:7706-7710, 1989, Jaeger etal. Methods EnzymoL 183:281-306, 1989 which are herein 
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incorporated by reference for at least material related to nucleic acid alignment. 

It is understood that the description of conservative mutations and homology can be 
combined together in any combination, such as embodiments that have at least 70% homology to a 
particular sequence wherein the variants are conservative mutations. 

5 As this specification discusses various proteins and protein sequences it is understood that 

the nucleic acids that can encode those protein sequences are also disclosed. This would include 
all degenerate sequences related to a specific protein sequence, i.e. all nucleic acids having a 
sequence that encodes one particular protein sequence as well as all nucleic acids, including 
degenerate nucleic acids, encoding the disclosed variants and derivatives of the protein sequences. 

10 Thus, while each particular nucleic acid sequence may not be written out herein, it is understood 
that each and every sequence is in fact disclosed and described herein through the disclosed 
protein sequence. For example, one of the many nucleic acid sequences that can encode the 
protein sequence set forth in SEQ ID NO:6 is set forth in SEQ ID NO:5. Another nucleic acid 
sequence that encodes the same protein sequence set forth in SEQ ID NO:6 is set forth in SEQ ID 

15 NO: 16. In addition, for example, a disclosed conservative derivative of SEQ ID NO:2 is shown in 
SEQ ID NO: 17, where the valine (V) at position 34 is changed to a isoleucine (I). It is understood 
that for this mutation all of the nucleic acid sequences that encode this particular derivative of the 
beta4 integrin are also disclosed including for example SEQ ID NO: 18 and SEQ ID NO: 19 which 
set forth two of the degenerate nucleic acid sequences that encode the particular polypeptide set 

20 forth in SEQ ID NO: 1 7. It is also understood that while no amino acid sequence indicates what 
particular DNA sequence encodes that protein within an organism, where particular variants of a 
disclosed protein are disclosed herein, the known nucleic acid sequence that encodes that protein 
in the particular organism from which that protein arises is also known and herein disclosed and 
described. 

25 i) Pharm aceutical carriers/Delivery of pharmaceutical products 

As described above, the compositions, for example, compositions that inhibit alpha6 
function, beta4 function, or gamma2 function can also be administered in vivo in a 
pharmaceutical^ acceptable carrier. By "pharmaceutically acceptable 0 is meant a material that is 
not biologically or otherwise undesirable, i.e., the material may be administered to a subject, along 

30 with the nucleic acid or vector, without causing any undesirable biological effects or interacting in 
a deleterious manner with any of the other components of the pharmaceutical composition in 
which it is contained. The carrier would naturally be selected to minimize any degradation of the 
active ingredient and to minimize any adverse side effects in the subject, as would be welt known 
to one of skill in the art. 

35 The compositions may be administered orally, parenterally (e.g., intravenously), by 
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intramuscular injection, by intraperitoneal injection, transdermal^, extracorporeal^, topically or 
the like, although topical intranasal administration or administration by inhalant is typically 
preferred. As used herein, "topical intranasal administration" means delivery of the compositions 
into the nose and nasal passages through one or both of the nares and can comprise delivery by a 
5 spraying mechanism or droplet mechanism, or through aerosolization of the nucleic acid or vector. 
The latter may be effective when a large number of animals is to be treated simultaneously. 
Administration of the compositions by inhalant can be through the nose or mouth via delivery by a 
spraying or droplet mechanism. Delivery can also be directly to any area of the respiratory system 
(e.g., lungs) via intubation. The exact amount of the compositions required will vary from subject 
10 to subject, depending on the species, age, weight and general condition of the subject, the seventy 
of the allergic disorder being treated, the particular nucleic acid or vector used, its mode of 
administration and the like. Thus, it is not possible to specify an exact amount for every 
composition. However, an appropriate amount can be determined by one of ordinary skill in the 
art using only routine experimentation given the teachings herein. 

15 Parenteral administration of the composition, if used, is generally characterized by 

injection. Injectables can be prepared in conventional forms, either as liquid solutions or 
suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
emulsions. A more recently revised approach for parenteral administration involves use of a slow 
release or sustained release system such that a constant dosage is maintained. See, e.g., U.S. 

20 Patent No. 3,610,795, which is incorporated by reference herein. 

The materials may be in solution, suspension (for example, incorporated into 
rnicroparticles, liposomes, or cells). These may be targeted to a particular cell type via antibodies, 
receptors, or receptor ligands. The following references are examples of the use of this technology 
to target specific proteins to tumor tissue (Senter, et al., Bioconiugate Chem.. 2:447-451 , (1991); 

25 Bagshawe, K.D., Br. J. Cancer . 60:275-281, (1989); Bagshawe, et ah, Br. J. Cancer, 58:700-703, 
(1988); Senter, et al., Bioconiugate Chem.. 4:3-9, (1993); Battelli, et al., Cancer Immunol. 
Immunother.. 35:421-425, (1992); Pietersz and McKenzie. Immunolog. Reviews . 129:57-80, 
(1992); and Roffler, et al, Biochem. Pharmacol . 42:2062-2065, (1991)). Vehicles such as 
"stealth" and other antibody conjugated liposomes (including lipid mediated drug targeting to 

30 colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte 
directed tumor targeting, and highly specific therapeutic retroviral targeting of murine glioma cells 
in vivo. The following references are examples of the use of this technology to target specific 
proteins to tumor tissue (Hughes et al., Cancer Research . 49:6214-6220, (1989); and Litzinger and 
Huang. Biochimica et Biophvsica Acta. 1104:179-187,(1992)). In general, receptors are involved 

35 in pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in 

clathrin-coated pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome 
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in which the receptors are sorted, and then either recycle to the cell surface, become stored 
intracellularly, or are degraded in lysosomes. The internalization pathways serve a variety of 
functions, such as nutrient uptake, removal of activated proteins, clearance of macromolecules, 
opportunistic entry of viruses and toxins, dissociation and degradation of ligand, and receptor-level 
5 regulation. Many receptors follow more than one intracellular pathway, depending on the cell 
type, receptor concentration, type of ligand, ligand valency, and ligand concentration. Molecular 
and cellular mechanisms of receptor-mediated endocytosis has been reviewed (Brown and Greene, 
DNA and Cell Biology 10:6,399-409(1991)). 

(1) Pharmaceutical^ Acceptable Carriers 
10 The compositions, including antibodies, can be used therapeutically in combination with a 

pharmaceutical^ acceptable carrier. 

Pharmaceutical carriers are known to those skilled in the art. These most typically would 
be standard carriers for administration of drugs to humans, including solutions such as sterile 
water, saline, and buffered solutions at physiological pH. The compositions can be administered 
15 intramuscularly or subcutaneously. Other compounds will be administered according to standard 
procedures used by those skilled in the art. 

Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, 
preservatives, surface active agents and the like in addition to the molecule of choice. 
Pharmaceutical compositions may also include one or more active ingredients such as antimicrobial 
20 agents, anti-inflammatory agents, anesthetics, and the like. 

The pharmaceutical composition may be administered in a number of ways depending on 
whether local or systemic treatment is desired, and on the area to be treated. Administration may be 
topically (including ophthalmically, vaginally, rectally, intranasally), orally, by inhalation, or 
parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or intramuscular 
25 injection. The disclosed antibodies can be administered intravenously, intraperitoneally, 
intramuscularly, subcutaneously, intracavity, or transdermal ly. 

Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
30 oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishes, electrolyte replenishes (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, for 
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example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

Formulations for topical administration may include ointments, lotions, creams, gels, drops, 
suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder 
or oily bases, thickeners and the like may be necessary or desirable. 

5 Compositions for oral administration include powders or granules, suspensions or solutions 

in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, diluents, 
ernulsifiers, dispersing aids or binders may be desirable. 

Some of the compositions may potentially be administered as a pharmaceutically 
acceptable acid- or base- addition salt, formed by reaction with inorganic acids such as 

10 hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, and 
phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, 
lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric acid, or 
by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, potassium 
hydroxide, and organic bases such as mono-, di-, trialkyl and aryl amines and substituted 

15 ethanolamines. 

(2) Therapeutic Uses 
The dosage ranges for the administration of the compositions are those large enough to 
produce the desired effect in which the symptoms disorder are effected. The dosage should not be 
so large as to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, 
20 and the like. Generally, the dosage will vary with the age, condition, sex and extent of the disease 
in the patient and can be determined by one of skill in the art. The dosage can be adjusted by the 
individual physician in the event of any counterindications. Dosage can vary, and can be 
administered in one or more dose administrations daily, for one or several days. 

Other compositions which do not have a specific pharmaceutical function, but which may 
25 be used for tracking changes within cellular chromosomes or for the delivery of diagnostic tools 
for example can be delivered in ways similar to those described for the pharmaceutical products. 

\) Chips and micro arrays 
Disclosed are chips where at least one address is the sequences or part of the sequences set 
forth in any of the nucleic acid sequences disclosed herein. Also disclosed are chips where at least 
30 one address is the sequences or portion of sequences set forth in any of the peptide sequences 
disclosed herein. 

Also disclosed are chips where at least one address is a variant of the sequences or part of 
the sequences set forth in any of the nucleic acid sequences disclosed herein. Also disclosed are 
chips where at least one address is a variant of the sequences or portion of sequences set forth in 
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any of the peptide sequences disclosed herein. 

k) Computer readable mediums 
It is understood that the disclosed nucleic acids and proteins can be represented as a 
sequence consisting of the nucleotides of amino acids. There are a variety of ways to display these 
5 sequences, for example the nucleotide guanosine can be represented by G or g. Likewise the 
amino acid valine can be represented by Val or V. Those of skill in the art understand how to 
display and express any nucleic acid or protein sequence in any of the variety of ways that exist, 
each of which is considered herein disclosed. Specifically contemplated herein is the display of 
these sequences on computer readable mediums, such as, commercially available floppy disks, 
10 tapes, chips, hard drives, compact disks, and video disks, or other computer readable mediums. 
Also disclosed are the binary code representations of the disclosed sequences. Those of skill in the 
art understand what computer readable mediums. Thus, computer readable mediums on which the 
nucleic acids or protein sequences are recorded, stored, or saved. 

Disclosed arc computer readable mediums comprising the sequences and information regarding 
15 the sequences set forth herein. Also disclosed are computer readable mediums comprising the 
sequences and information regarding the sequences set forth herein wherein the sequences do not 
include SEQ ID NOs: SEQ lDNOs:l-19. 

I) Kits 

Disclosed herein are kits that are drawn to reagents that can be used in practicing the 
20 methods disclosed herein* The kits can include any reagent or combination of reagent discussed 
herein or that would be understood to be required or beneficial in the practice of the disclosed 
methods. For example, the kits could include primers to perform the amplification reactions 
discussed in certain embodiments of the methods, as well as the buffers and enzymes required to 
use the primers as intended. For example, disclosed is a kit for assessing a subject's risk for 
25 acquiring cancer, such as colon cancer, comprising the primers or probes that hybridize to the 
sequences set forth in SEQ ID NOs: 1, 3, 5, 7, 9, 1 1, and 13, for example. 

m) Compositions with similar functions 
It is understood that the compositions disclosed herein have certain functions, such as 
inhibiting gamma2 function or binding alpha6 integrin or inhibiting beta4 function. Disclosed 
30 herein are certain structural requirements for performing the disclosed functions, and it is 

understood that there are a variety of structures which can perform the same function which are 
related to the disclosed structures, and that these structures will ultimately achieve the same result, 
for example stimulation or inhibition alpha6beta4 signaling or interruption of the alpha6beta4 
signaling pathway. 
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2. Methods of making the compositions 

The compositions disclosed herein and the compositions necessary to perform the 
disclosed methods can- be made using any method known to those of skill in the art for that 
particular reagent or compound unless otherwise specifically noted. 

5 a) Nucleic acid synthesis 

For example, the nucleic acids, such as, the oligonucleotides to be used as primers can be 
made using standard chemical synthesis methods or can be produced using enzymatic methods or 
any other known method. Such methods can range from standard enzymatic digestion followed by 
nucleotide fragment isolation (see for example, Sambrook et aL t Molecular Cloning: A Laboratory 

10 Manual, 2nd Edition (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989) 
Chapters 5, 6) to purely synthetic methods, for example, by the cyanoethyl phosphoramidite 
method using a Milligen or Beckman System lPlus DNA synthesizer (for example, Model 8700 
automated synthesizer of Milligen-Biosearch, Burlington, MA or ABI Model 380B). Synthetic 
methods useful for making oligonucleotides are also described by Ikuta et al.,Ann. Rev. Biochem. 

15 53:323-356 (1984), (phosphotriester and phosphite- triester methods), and Narang et al % Methods 
EnzymoL, 65:610-620 (1980), (phosphotriester method). Protein nucleic acid molecules can be 
made using known methods such as those described by Nielsen et al., Bioconjug. Chem, 5:3-7 
(1994). 

b) Peptide synthesis 

20 One method of producing the disclosed proteins, or fragments of the disclosed proteins, 

such as a fragment of SEQ [D NO:6, is to link two or more peptides or polypeptides together by 
protein chemistry techniques.' For example, peptides or polypeptides can be chemically 
synthesized using currently available laboratory equipment using either Fmoc (9- 
fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. (Applied Biosystems, 

25 Inc., Foster City, CA). One skilled in the art can readily appreciate that a peptide or polypeptide 
corresponding to the disclosed proteins, for example, can be synthesized by standard chemical 
reactions. For example, a peptide or polypeptide can be synthesized and not cleaved from its 
synthesis resin whereas the other fragment of a peptide or protein can be synthesized and 
subsequently cleaved from the resin, thereby exposing a terminal group which is functionally 

30 blocked on the other fragment. By peptide condensation reactions, these two fragments can be 
co valently joined via a peptide bond at their carboxyl and amino termini, respectively, to form an 
antibody, or fragment thereof. (Grant OA (1992) Synthetic Peptides: A User Guide. W.H. 
Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide 
Synthesis, Springer- Verlag Inc., NY (which is herein incorporated by reference at least for 

35 material related to peptide synthesis). Alternatively, the peptide or polypeptide is independently 
synthesized /;; vivo as described herein. Once isolated, these independent peptides or polypeptides 
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may be linked to form a peptide or fragment thereof via similar peptide condensation reactions. 

For example, enzymatic ligation of cloned or synthetic peptide segments allow relatively 
short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole 
protein domains (Abrahmsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, native 
5 chemical ligation of synthetic peptides can be utilized to synthetically construct large peptides or 
polypeptides from shorter peptide fragments. This method consists of a two step chemical reaction 
(Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). 
The first step is the chemoselective reaction of an unprotected synthetic peptide-thioester with 
another unprotected peptide segment containing an arnino-terminal Cys residue to give a thioester- 
10 linked intermediate as the initial covalent product. Without a change in the reaction conditions, 
this intermediate undergoes spontaneous, rapid intramolecular reaction to form a native peptide 
bond atthe ligation site (Baggiolini M etal. (1992) FEBS Lett. 307:97-101; Clark-Lewis I et al., 
JJBiol.Chem., 269:16075 (1994); Clark-Lewis I etal. Biochemistry, 30:3128 (1991); Rajarathnam 
K et al. Biochemistry 33:6623-30 (1994)). 

15 Alternatively, unprotected peptide segments are chemically linked where the bond formed 

between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) 
bond (Schnolzer, Met al. Science, 256:221 (1992)). This technique has been used to synthesize 
analogs of protein domains as well as large amounts of relatively pure proteins with full biological 
activity (deLisle Milton RC et al. Techniques in Protein Chemistry IV. Academic Press, New 

20 York, pp. 257-267(1992)). 

The disclosed proteins and polypeptides such as that for SEQ ID NO:6, beta4 integrin, can be 
made using any traditional recombinant biotechnology method. Examples of such methods can be 
found in Sambrook et al. which is herein incorporated by reference at least for material related to 
production of proteins and antibodies. 

25 c) Processes for making the compositions 

Disclosed are processes for making the compositions as well as making the intermediates 
leading to the compositions. For example, disclosed are nucleic acids in SEQ TD NOs: 1, 3, 5, 7, 9, 
1 1, and 13. There are a variety of methods that can be used for making these compositions, such 
as synthetic chemical methods and standard molecular biology methods. It is understood that the 

30 methods of making these and the other disclosed compositions are specifically disclosed. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
operative way a nucleic acid molecule comprising the sequence set forth in SEQ ID Nos: 1 , 3, 5, 7, 
9, 1 1, or 13 or a fragment thereof, and a sequence controlling the expression of the nucleic acid. 

Also disclosed are nucleic acid molecules produced by the process comprising linking in 
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an operative way a nucleic acid molecule comprising a sequence having at least 80% identity to a 
sequence set forth in SEQ IDNos:l, 3, 5, 7, 9, 1 1, or 13 or a fragment thereof, and a sequence 
controlling the expression of the nucleic acid. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
5 operative way a nucleic acid molecule comprising a sequence that hybridizes under stringent 
hybridization conditions to a nucleic acid molecule comprising the sequence set forth in SEQ ID 
Nos:l, 3, 5, 7, 9, 1 1, or 13 or a fragment thereof, and a sequence controlling the expression of the 
nucleic acid. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
10 operative way a nucleic acid molecule comprising a sequence encoding a peptide set forth in SEQ 
ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof, and a sequence controlling an expression of 
the nucleic acid molecule. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
operative way a nucleic acid molecule comprising a sequence encoding a peptide having 80% 
15 identity to a peptide set forth in SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof and a 
sequence controlling an expression of the nucleic acid molecule. 

Disclosed are nucleic acids produced by the process comprising linking in an operative 
way a nucleic acid molecule comprising a sequence encoding a peptide having 80% identity to a 
peptide set forth in SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof, wherein any change 
20 from the sequences set forth in SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof are 
conservative changes and a sequence controlling an expression of the nucleic acid molecule. 

Disclosed are cells produced by the process of transforming the cell with any of the 
disclosed nucleic acids. Disclosed are cells produced by the process of transforming the cell with 
any of the non-naturally occurring disclosed nucleic acids. 

25 Disclosed are any of the disclosed peptides produced by the process of expressing any of 

the disclosed nucleic acids. Disclosed are any of the non-naturally occurring disclosed peptides 
produced by the process of expressing any of the disclosed nucleic acids. Disclosed are any of the 
disclosed peptides produced by the process of expressing any of the non-naturally disclosed 
nucleic acids. 

30 Disclosed are animals produced by the process of transfecting a cell within the animal with 

any of the nucleic acid molecules disclosed herein. Disclosed are animals produced by the process 
of transfecting a ceil within the animal any of the nucleic acid molecules disclosed herein, wherein 
the animal is a mammal. Also disclosed are animals produced by the process of transfecting a cell 
within the animal any of the nucleic acid molecules disclosed herein, wherein the mammal is 
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mouse, rat, rabbit, cow, sheep, pig, or primate. 

Also disclose are animals produced by the process of adding to the animal any of the cells 
disclosed herein. 

d) Products produced from selection protocols 
5 Also disclosed are methods of obtaining molecules that act as functional regulators of 

integrin function, integrin receptor function, and functional regulators of signaling pathways 
related to integrin receptors, in particular integrin alpha6beta4. 

Disclosed are methods for isolating molecules that interact with the proteins set forth in 
SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof comprising, interacting a library of 
10 molecules with the proteins set forth in SEQ ID Nos;2, 4, 6, 8, 10, 12, or 14 or a fragment thereof, 
removing the unbound molecules, and collecting the molecules that are bound to at least one of the 
proteins set forth in SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof. 

3. Methods of using the compositions 

a) Methods of using the compositions as research tools 

15 The compositions can be used for example as targets in combinatorial chemistry protocols 

or other screening protocols to isolate molecules that possess desired functional properties related 
to A6B4 and A6B1 signaling pathways. 

The disclosed compositions can also be used diagnostic tools related to diseases such as 
cancer, such as colon cancer. 

20 The disclosed compositions can be used as discussed herein as either reagents in micro 

arrays or as reagents to probe or analyze existing microarrays. The disclosed compositions can be 
used in any known method for isolating or identifying single nucleotide polymorphisms. The 
compositions can also be used in any method for determining allelic analysis of for example, beta4 
alleles having varying function, particularly allelic analysis as it relates to beta4 signaling and 

25 functions. The compositions can also be used in any known method of screening assays, related to 
chip/micro arrays. The compositions can also be used in any known way of using the computer 
readable embodiments of the disclosed compositions, for example, to study relatedness or to 
perform molecular modeling analysis related to the disclosed compositions. 

b) Methods for affecting cancer 

30 The disclosed compositions can be used to affect the growth of cancer cells because as 

disclosed herein, the disclosed relationships are fundamental to the ability of cancer cells to 
continue growing. The disclosed compositions, such as antisense constructs that will inhibit the 
production of either alpha6, beta4, or the gamma2 chain of laminin5 reduce the proliferation of 
cancer cells. In fact, the compositions cannot only reduce the proliferation of the cancer cells, but 



61 



WO 02/30465 



PCT/US01/32127 



the compositions can kill the cancer cells, as shown herein. 

It is understood that cancer is caused by a variety of cellular events, of which certain 
events related to alpha6 intcgrin (up regulated), beta4 integrin (upregulated) and gamma2 
(upregulated) allow the continued viability of cancer cells, and that interference of these 
5 upregulated molecules inhibits the growth and kills the cancer cells. However, there are other 
known events that can cause non-cancerous cells to become oncogenic. For example, Abl, Ras, 
EGF receptor, ErB-2, APC, beta-catenin, Arf, Mdm2, p53, Rb, Myc are known to be involved in 
oncogenesis, and some of these molecules (For exmaple, Ras, APC loss, p53 loss) are directly 
related to the disclosed target signal transduction pathways mediated by the alpha6beta4 receptor. 
10 Just as the presently disclosed compositions can be used as therapeutics targeting the disclosed 
relationships, so to there are other targets (For example, Abl, ErB-2) for which pharmaceutical 
compositions have been developed (For example, Glivec, Herceptin, respectively). 

It is understood that the disclosed anti-cancer compositions can be used in combination 
with other anti-cancer compositions. Great benefits can be obtained from using anti-cancer 
15 compositions that target different molecules, in the same signal transduction pathway as well as, or 
in addition to, targeting molecules in different signal transduction pathways than those disclosed 
herein. Thus, the disclosed compositions which can affect the growth of cancer cells, indeed kill 
cancer cells, can be used in conjunction with any other chemotherapy, radiation, or any other anti- 
cancer therapy. 

20 Disclosed are methods of reducing the proliferation of a cancer cell which comprises 

inhibiting ligand binding to an integrin receptor on the cancer cell, wherein the integrin receptor 
comprises an integrin. 

Also disclosed are methods of reducing the proliferation of a cancer cell which comprises 
reducing integrin-integrin interaction, integrin receptor clustering interaction, or integrin-non- 
25 integrin protein interaction. 

Also disclosed are methods of selectively reducing the proliferation of cancer cells which 
comprises reducing integrins from interacting with one another, integrins from clustering, or 
integrins from interacting with other proteins associated with cancer cells. 

Disclosed are methods of reducing the proliferation of a cancer cell which comprises 
30 reducing the production of an integrin by the cancer cell. 

Also disclosed are methods of reducing the proliferation of a cancer cell which comprises 
reducing the production of an integrin receptor ligand by the cancer cell. 

Disclosed are methods of reducing the proliferation of a cancer cell which comprises 
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interfering with an integrin signaling pathway. 

Also disclosed are methods of selectively killing or reducing the proliferation of cancer 
cells which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein 
the integrin receptor comprises a B4 integrin. 

5 Disclosed are methods of selectively killing or reducing the proliferation of cancer cells 

which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein the 
ligand comprises a lamininS. 

Also disclosed are methods of selectively killing or reducing the proliferation of cancer 
cells which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein 
1 0 the ligand comprises the gamma2 subunit of the laminin L5. 

Disclosed are methods of selectively killing or reducing the proliferation of cancer cells 
which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein the 
integrin receptor comprises an alpha6 integrin. 

Further disclosed are methods of selectively killing or reducing the proliferation of cancer 
1 5 cells which comprises preventing integrin receptor subunits from interacting with one another, 
preventing integrin clustering, or preventing integrin receptor subunits from interacting with other 
proteins on cancer cells, wherein the integrin receptor comprises a B4 integrin. 

Disclosed are methods of selectively killing or reducing the proliferation of cancer cells 
which comprises reducing the production of laminin by the cancer cells, wherein the laminin 
20 comprises lamininS, and/or any of the subunits of laminin5, such as gamma2. 

Also disclosed is a method of selectively killing or reducing the proliferation of cancer 
cells which comprises interfering with an integrin signaling pathway, wherein the integrin 
signaling pathway comprises a B4 integrin or an alpha6 integrin or a betal integrin or a laminin or 
a lamininS or the garnma2 subunit of lamininS. 

25 Also dislcosed are methods, wherein the integrin receptor comprises integrin B4 and/or 

wherein the integrin receptor comprises integrin A6, and/or wherein the ligand that binds to the 
integrin receptor is lamininS, and/or wherein the integrin receptor is A6B4, and/or wherein the 
ligand comprises lamininS, and/or wherein the ligand comprises the gamma-2 subunit of lamininS, 
and/or wherein inhibiting ligand binding to an integrin receptor does not occur by using an 

30 antisense molecule to A6, and/or wherein inhibiting ligand binding to an integrin receptor 

comprises contacting a A6 integrin with a composition that inhibits ligand binding, and/or wherein 
inhibiting ligand binding to an integrin receptor comprises contacting a B4 integrin with a 
composition that inhibits ligand binding, and/or wherein inhibiting ligand binding to an integrin 
receptor comprises contacting a lamininS with a composition that inhibits ligand binding, and/or 
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wherein inhibiting ligand binding to an integrin receptor comprises contacting a gamma-2 subunit 
with a composition that inhibits ligand binding, and/or wherein reducing integrin-integrin 
interaction, integrin receptor clustering interaction, or integrin-non-integrin protein interaction 
does, not occur by using a B4-delta-cyt, and/or wherein reducing integrin-integrin interaction, 
5 integrin receptor clustering interaction, or integrin-non-integrin protein interaction does not occur 
by using an antisense molecule to A6, and/or wherein reducing integrin-integrin interaction, 
integrin receptor clustering interaction, or integrin-non-integrin protein interaction comprises 
contacting a A6 integrin with a composition that inhibits an interaction between the B4 integrin 
and another integrin or protein molecule, and/or wherein reducing integrin-integrin interaction, 

10 integrin receptor clustering interaction, or integrin-non-integrin protein interaction comprises 
contacting a B4 integrin with a composition that inhibits the interaction between the B4 integrin 
and another integrin or protein molecule, and/or wherein reducing integrin-integrin interaction, 
integrin receptor clustering interaction, or integrin-non-integrin protein interaction comprises 
contacting a lamininS with a composition that inhibits ligand binding, and/or wherein reducing 

1 5 integrin-integrin interaction, integrin receptor clustering interaction, or integrin-non-integrin 

protein interaction comprises contacting a gamma2 subunit with a composition that inhibits ligand 
binding, and/or wherein the non-integrin protein comprises a growth factor receptor, and/or 
wherein the non-integrin protein comprises a hemi-desomosome junction, and/or wherein the non- 
integrin protein comprises a SH2 domain, and/or wherein the non-integrin protein comprises a She 

20 protein, and/or wherein the non-integrin protein comprises a IRS-1 protein, and/or wherein the 
non-integrin protein comprises a IRS-2 protein, and/or wherein reducing the production of an 
integrin does not occur by using an antisense molecule to A6, and/or wherein the production of an 
integrin is reduced by inhibiting signaling leading to induction of expression of an integrin, and/or 
wherein reducing the production of an integrin comprises inhibiting alpha6 production, and/or 

25 wherein inhibiting alpha6 production further comprises using antisense molecules to alpha6 
mRNA, and/or wherein reducing the production of an integrin comprises inhibiting beta4 
production, and/or wherein inhibiting beta4 production further comprises using antisense 
molecules to beta4 mRNA, and/or wherein reducing the production of an integrin receptor ligand 
comprises inhibiting gamrna2 production, and/or wherein reducing the production of an integrin 

30 receptor ligand comprises inhibiting laminin production, and/or wherein reducing the production 
of an integrin receptor ligand comprises inhibiting laminin5 production, and/or wherein interfering 
with an integrin signaling pathway does not occur by using a B4-delta-cyt, and/or wherein 
interfering with an integrin signaling pathway does not occur by using an antisense molecule to 
A6, and/or wherein interfering with an integrin signaling pathway comprises contacting an A6 

35 integrin in the cell with a composition that inhibits ligand binding, and/or wherein interfering with 
an integrin signaling pathway comprises contacting a B4 integrin with a composition that inhibits 



64 



WO 02/30465 



PCT/USO 1/32127 



ligand binding, and/or wherein interfering with an integrin signaling pathway comprises contacting 
a laminin5 with a composition that inhibits ligand binding, and/or wherein interfering with an 
integrin signaling pathway comprises contacting a gamma2 subunit with a composition that 
inhibits ligand binding, and/or wherein interfering with an integrin signaling pathway comprises 
5 contacting the cancer cell with a molecule that interferes with at least one of talin, paxillin, 
vinculin, a CAS family protein, CRX, NCK, FAK, ILK, Src, Fyn, She, Grb-2, Guaning nucleotide 
exchange factors, SOS, DOCK 180,.Vav, Syk, P-l-3 kinase, AKT, Bad, Bid, Caspase 9, Cdc42, 
PAK, Rac, Rho, Rho kinase, Ras, Caveolin, Tetraspan, Receptor-type protein tyrosine 
phosphatase, SHP-2, Alpha-actinin, Filamin, Cytohesin, Beta3-endonexin, ICAP-1, RACK-1, CEB, 
10 actin, receptor tyrosine kinase, JRS-1 or IRS-2, and/or wherein interfering with an integrin 
signaling pathway comprises contacting the cancer cell with an agent that interferes with post- 
translational modification of integrins, and/or wherein the post translational modification is 
glycosylation or phosphorylation. 

Also disclosed are methods, wherein the integrin comprises an A6 integrin, and/or wherein 

1 5 the integrin comprises a B4 integrin, and/or further comprising reducing a laminin5-integrin 
interaction, and/or further comprising reducing a lamininS gamma2 integrin interaction, and/or 
wherein the cancer cell comprises normal p53, and/or wherein the proliferation of the cancer cells 
is not dependent on AKT/PKB, and/or wherein reducing the proliferation of the cancer cells is 
selective, and/or wherein the cancer cell is not an MDA-MB-435 cell, and/or wherein the cancer 

20 cell is not an HMT-3522 cell, and/or wherein the cancer cell is not an RKO colon carcinoma line, 
and/or wherein the cancer cell does not express exogenous B4 integrin, and/or further comprising 
contacting the cancer cells with a small molecule, peptide, peptide mimetic, or oligonucleotide or 
synthetic analog thereof, and/or wherein the cancer cells are contacted with dominant-negative 
beta 4 integrin, and/or wherein the cancer cell is contacted with an antisense molecule, and/or 

25 wherein the antisense molecule is linked to a leader sequence which enables translocation across a 
cell membrane, and/or wherein the leader sequence binds to a cell surface protein which facilitates 
internalization, and/or wherein the leader sequence is TAT or antennapedia, or fragment thereof, 
and/or wherein the antisense molecule is an alpha6 RNA antisense molecule, and/or wherein the 
small molecule peptide, peptide mimetic, or oligonucleotide or synthetic analog thereof is linked 

30 to a carrier, and/or wherein the carrier is at least one of a lipidic carrier, charged carrier, retroviral 
carrier, TAT or fragment thereof, antennapedia or fragment thereof, or polyethylene glycol, and/or 
further comprising contacting the cancer cell with another agent which modulates cell signaling, a 
chemotherapeutic drug, or treated with radiation or angiogenesis inhibitor, and/or wherein 
reducing the proliferation of cancer cell is in vitro, and/or wherein reducing the proliferation of the 

35 cancer cell is in vivo, and/or wherein the cancer cell is selected from the group consisting of 
melanoma, adenoma, lymphoma, myeloma, carcinoma, plasmocytoma, sarcoma, glioma, thyoma, 
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leukemia, skin cancer, retinal cancer, breast cancer, prostate cancer, colon cancer, esophageal 
cancer, stomac cancer, pancreas cancer, brain tumors, lung cancer, ovarian cancer, cervical cancer, 
hepatic cancer, gastrointestinal cancer, and head and neck cancer cells, and/or wherein the cancer 
cell is killed, and/or wherein the cancer cell expresses a mutated Ras, and/or wherein the cancer 
5 cell expresses a mutated Ras and a mutated p53, and/or wherein the cancer cell expresses a 
mutated Ras and activates the AKT/PKB protein, and/or wherein the cancer cell expresses a 
mutated Ras, a mutated p53, and activates the AKT/PKB protein, and/or wherein the cancer cell 
expresses a mutated APC, and/or wherein the cancer cell expresses a mutated Ras and mutated 
APC. 

1 0 Disclosed are methods of reducing the proliferation of a cancer cell in a patient which 

comprises administering to the patient a composition which inhibits ligand binding to an integrin 
receptor on the cancer cell, wherein the integrin receptor comprises an integrin. 

Also disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
comprises administering to the patient a composition which reduces integrin- integrin interaction, 
15 integrin receptor clustering interaction, or integrin-non-integrin protein interaction. 

Also disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
comprises administering to the patient a composition which reduces the production of an integrin 
or laminin by the cancer cell. 

Further disclosed are methods of reducing the proliferation of a cancer cell in a patient 
20 which comprises administering to the patient a composition which interfers with an integrin 
signaling pathway. 

Also disclsoed are methods, wherein the reduction in cancer cell proliferation is selective, 
and/or wherein the administering is local or systemic, and/or wherein the patient is additionally 
administered an agent which modulates cell signaling, a chemotherapeutic drug, or treated with 

25 radiation or angiogenesis inhibitor, and/or wherein the additional agent is administered serially or 
in combination, and/or wherein the local administering is direct application to cancer cells, and/or 
wherein the direct application to cancer cells is performed during surgery, and/or wherein the 
direct application to cancer cells is performed topically to cancerous tissue, and/or wherein the 
systemic administration is by subcutaneous, intraperitoneal, intra-arterial, intravenous, or bolus 

30 administration, or by application through a catheter or similar apparatus, and/or wherein the 
systemic administration comprises a long-term release formulation, and/or wherein the systemic 
administration is by oral administration, and/or wherein the oral administration comprises 
administering a pill, capsule, tablet, liquid or suspension. 

Disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
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comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which inhibits ligand binding to integrin receptors on cancer cells wherein the coding sequence is 
under the control of a promoter which functions in mammalian cells. 

Also disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
5 comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which prevents integrin receptor subunits from interacting with one another, prevents integrin 
receptor clustering interaction, or prevents integrin receptor subunits from interacting with other 
proteins on cancer cells wherein the coding sequence is under the control of a promoter which 
functions in mammalian cells. 

10 Further disclosed are methods of reducing the proliferation of a cancer cell in a patient 

which comprises administering to a patient a vector comprising coding sequence for a protein or 
peptide which prevents integrin receptor subunits from interacting with one another, prevents 
integrin receptor clustering interaction, or prevents integrin receptor subunits from interacting with 
other proteins in cancer cells wherein the coding sequence is under the control of a promoter which 

15 functions in mammalian cells. 

Also disclosed are method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which interferes with integrin subunit or laminin production wherein the coding sequence is under 
the control of a promoter which functions in mammalian cells. 

20 Further disclosed are methods of reducing the proliferation of a cancer cell in a patient 

which comprises administering to a patient a vector comprising coding sequence for a protein or 
peptide which interferes with an integrin signaling pathway of cells wherein the coding sequence is 
under the control of a promoter which functions in mammalian cells. 

Also disclosed are methods, wherein the reduction in cancer cell proliferation is selective, 
25 and/or wherein the vector is administered directly to a cancer cell, and/or wherein the vector is 
administered directly to a normal cell, and/or wherein the vector is packaged in a viral vector or 
liposome, and/or wherein the vector is a retroviral vector, and/or wherein the vector is 
administered systemically, and/or wherein the direct administration is by topical application, 
and/or wherein the direct administration is by topical application, and/or wherein the direct 
30 administration is performed during surgery, and/or wherein the direct administration is performed 
during surgery, and/or wherein the patient is an animal, such as a mammal, mouse, rabbit, primate, 
chimp, ape, goriilla, and human, and/or wherein the cancer cells are selected from the group 
consisting of melanoma, adenoma, lymphoma, myeloma, carcinoma, plasmocytoma, sarcoma, 
glioma, thyoma, leukemia, skin cancer, retinal cancer, breast cancer, prostate cancer, colon cancer, 
35 esophageal cancer, stomac cancer, pancreas cancer, brain tumors, lung cancer, ovarian cancer. 
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cervical cancer, hepatic cancer, gastrointestinal cancer, and head and neck cancer cells, and/or 
wherein the patient is additionally administered at least one of another agent which modulates cell 
signaling, a chemotherapeutic drug, an angiogenesis inhibitor or treated with radiation, and/or 
wherein the other agent which modi Ties cell signaling, chemotherapeutic drug, angiogenesis 
5 inhibitor or radiation treatment is administered serially or in combination. 

c) Methods of gene modification and gene disruption 
The disclosed compositions and methods can be used in targeted gene disruption and 
modification in any animal that can undergo these events. Gene modification and gene disruption 
refer to the methods, techniques, and compositions that surround the selective removal or 

10 alteration of a gene or stretch of chromosome in an animal, such as a mammal, in a way that 
propagates the modification through the germ line of the mammal. In general, a cell is 
transformed with a vector which is designed to homologously recornbine with a region of a 
particular chromosome contained within the cell, as for example, described herein. This 
homologous recombination event can produce a chromosome which has exogenous DNA 

15 introduced, for example in frame, with the surrounding DNA. This type of protocol allows for 
very specific mutations, such as point mutations, to be introduced into the genome contained 
within the cell. Methods for performing this type of homologous recombination are disclosed 
herein. 

One of the preferred characteristics of performing homologous recombination in 
20 mammalian cells is that the cells should be able to be cultured, because the desired recombination 
event occur at a low frequency. 

Once the cell is produced through the methods described herein, an animal can be 
produced from this cell through either stem cell technology or cloning technology. For example, if 
the cell into which the nucleic acid was transfected was a stem cell for the organism, then this cell, 

25 after transfection and culturing, can be used to produce an organism which will contain the gene 
modification or disruption in germ line cells, which can then in turn be used to produce another 
animal that possesses the gene modification or disruption in all of its cells. In other methods for 
production of an animal containing the gene modification or disruption in all of its cells, cloning 
technologies can be used. These technologies generally take the nucleus of the transfected cell and 

30 either through fusion or replacement fuse the transfected nucleus with an oocyte which can then be 
manipulated to produce an animal. The advantage of procedures that use cloning instead of ES 
technology is that cells other than ES cells can be transfected. For example, a fibroblast cell, 
which is very easy to culture can be used as the cell which is transfected and has a gene 
modification or disruption event take place, and then cells derived from this cell can be used to 

35 clone a whole animal. Conditional knockouts can also be made which will conditionally delete 
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expression of the desired molecule, for example, an A6 or integrin or the gamma2 chain of 
laminin5. 

d) Methods of diagnosing cancer 
Methods of diagnosing cancer using the disclosed information and the disclosed 
5 molecules. In particular disclosed are methdos of diagnoses that rely on the combined 
upregulation of both the ligand and the cogante integrin receptor or cognate integrins. It is 
understood that all of the methods of diagnosis disclosed herein, can be used with any of the 
disclosed compositions, but they also can be used in conjunction, where for example, both the 
ligand and the integrin receptor are monitored and correlated with cancer developing. 

10 Disclosed are methods of assessing a subject's risk of developing cancer comprising 

determining the amount of A6 present in a target cell obtained from the subject, wherein a 
determination of increased levels of A6 correlates with an increased risk of cancer. 

Disclosed are methods of assessing a subject's risk of acquiring cancer comprising 
determining the amount of B4 present in a target cell obtained from the subject, wherein a 
1 5 determination of increased levels of B4 correlates with an increased risk of cancer. 

Disclosed are methods of assessing a subject's risk of acquiring cancer comprising 
determining the amount of lamininS present in a target cell obtained from the subject, wherein a 
determination of increased levels of lamininS correlates with an increased risk of cancer. 

Disclosed are methods of assessing a subject's risk of acquiring cancer comprising 
20 determining the amount of gamma2 subunit present in a target cell obtained from the subject, 

wherein a determination of increased levels of gamma2 subunit correlates with an increased risk of 
cancer. 

Also disclosed are methods, further comprising comparing the amount A6 present to the 
amount in a control cell, and wherein determining the amount of A6 present in the target cell 

25 comprises assaying the amount of A6 mRNA in the cell, and.or wherein the assaying the amount 
of mRNA in the cell comprises hybridizing a A6 probe to a sample of the subject's mRNA, and/or 
wherein the assaying the amount of mRNA in the cell comprises hybridizing a A6 primer to a 
sample of the subject's mRNA, and/or wherein the assaying the amount of mRNA further 
comprises performing an nucleic acid amplification reaction involving the primer, and/or wherein 

30 the nucleic acid amplification reaction comprises reverse transcription, producing a cDNA, and/or 
wherein the nucleic acid amplification reaction further comprises performing a polymerase chain 
reaction on the cDNA, and/or wherein determining the amount of A6 present in the target cell 
comprises assaying the amount of A6 protein in the cell, and/or further comprising comparing the 
amount B4 present to the amount in a control cell, and/or further comprising comparing the 
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amount lamininS present to the amount in a control cell, and/or further comprising comparing the 
amount gamma2 subunit present to the amount in a control cell. 

C Examples 

Throughout this application, various publications are referenced. The disclosures of these 
5 publications in their entireties are hereby incorporated by reference into this application in order to 
more fully describe the state of the art to which this invention pertains. The references disclosed 
are also individually and specifically incorporated by reference herein for the material contained in 
them that is discussed in the sentence in which the reference is relied upon. 

It will be apparent to those skilled in the art that various modifications and variations can 
10 be made in the present invention without departing from the scope or spirit of the invention. Other 
embodiments of the invention will be apparent to those skilled in the art from consideration of the 
specification and practice of the invention disclosed herein. It is intended that the specification 
and examples be considered as exemplary only, with a true scope and spirit of the invention being 
indicated by the following claims. 

1 5 The following examples are put forth so as to provide those of ordinary skill in the art with 

a complete disclosure and description of how the compounds, compositions, articles, devices 
and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary of 
the invention and are not intended to limit the scope of what the inventors regard as their 
invention. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, 

20 temperature, etc.), but some errors and deviations should be accounted for. Unless indicated 
otherwise, parts are parts by weight, temperature is in °C or is at ambient temperature, and 
pressure is at or near atmospheric. 

1. Example 1 Colonic epithelial cells dependent on alpha6beta4 receptor 
si gnal transduction for growth in the absence of ECM 

25 Integrin alpha6 and integrin beta4 and laminin gamma2 chains are essential for tumor cell 

survival in vivo. The data disclosed herein indicate that interfering with integrin alpha6-mediated 
signaling, integrin beta 4signaling, and laminin gamma2 signaling can constitute an effective 
approach to induce programmed ceil death in cancer cells without damaging normal cells. 
Ablation of integrin alpha6-dependent signaling, integrin beta4-dependent signaling, and laminin 

30 gamma2- dependent signaling in human cancer cell lines supports this. 

The murine colonic epithelial transformation system introduced by D' Abaco et al. (1996) 
was used. In this system, control cells (control), cells containing an activated ras oncogene (Ras), 
a deletion in the APC gene (APCmin) or both alterations together (Ras+APCmin) can be compared 
with regard to their proliferation characteristics in tissue culture. The cells were derived from 
35 transgenic mice containing a temperature-sensitive allele of SV40 large T under the control of a 
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gamma interferon-inducible promoter permitting conditional immortalization (Jat et aL y 1991). 
All experiments were carried out with the cells being kept at non-permissive temperature in the 
absence of gamma interferon. As shown previously (D'Abaco et a/., 1996), only the cells carrying 
activated Ras and the APCmin mutation were able to form colonies in soft agar in the absence of 
5 anchorage to a substratum. All other cell populations did not give rise to colonies under these 
conditions. 

The frequency of tunel-staining cells (Fig.2A) and caspase 3 activity (Fig. 2B) in control, 
Ras, APCmin and Ras+APCmin colonic epithelial cells (D'Abaco et al., 1 996) in suspension was 
measured. Both assays show a strong suppression of apoptosis in Ras-APCmin cells, indicating 
10 that Ras and APCmin co-operate in preventing cell death in the absence of apparent ECM contacts. 
Activated Ras can protect cells from apoptosis via activation of the serine-threonine kinase AKT 
(Kauffmann-Zeh et al., 1997; Khwaja et al. t 1997). However, both Ras and Ras+APCmin cells 
show equivalent levels of AKT phosphorylation (not shown), eliminating AKT as the relevant 
target of Ras+APCmin cooperation. 

1 5 Although cancer cells have been thought to survive independently of integrin signaling, 

they frequently express high levels of alpha3 or alpha6 integrin receptors and/or their ligands 
(Dedhar et a/., 1993; Kennel et aL, 1989; Koshikawa et al., 1999; Lohi et aL, 2000; Nejjari et al., 
1999; Van Waes and Carey, 1992). These integrins are suspected to play a role in invasion and 
metastasis (Mukhopadhyay etai, 1999; Shaw etai, 1997). Ras and APCmin mutations would 

20 induce alterations in the mRNA expression patterns of integrin and ECM components. Induction 
of alpha6 integrin alone, was specific to Ras+APCmin cells (Fig. 3 A). In addition, we found 
elevated expression of the laminin alpha5, gamma2 and beta2 chains induced in Ras and 
Ras+APCmin cells (Fig. 3B)> These chains arc components of laminins 5, 10 and 1 1 (Malinda 
and Kleinman, 1996), which are ligands of integrin receptors alpha6/beta4 and 

25 alpha3/betal (Kikkawa et a/., 2000; Niessen et ai 3 1 994). These results indicate that Ras and 
APCmin mutations can cooperate to induce autocrine activation of integrin receptors. 

Alpha6 integrin can form functional receptors together with either betal or beta4 integrins 
(Clark and Brugge, 1995), leading to signaling through downstream effectors such as focal 
adhesion kinase (FAK) or the SH2 domain adapter She (Giancotti and Ruoslahti, 1999). 

30 Conversely, beta4 integrin only binds to alpha6 integrin (Clark and Brugge, 1995). In transformed 
colonic epithelial cells alpha6 integrin functions in conjunction with integrin beta4 and is engaged 
by the laminin gamma2 chain to activate She (see Fig. 4). Consistent with alpha6 integrin 
induction, only Ras+APCmin cells can bind a laminin gamma2-specific peptide in an alpha6 and 
beta4 integrin-specific manner. In contrast, an alpha3/betal and alpha6/betal -specific peptide 

35 derived from the laminin gammal chain, only binds to control cells (Fig. 4A). In addition, only in 
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Ras+ APCmin cells is She phosphorylated in response to clustering of the integrin alpha6fteta4 
receptor by beta4-specific antibodies (Mainiero et a/., 1995) and by the laminin gamma2 -specific 
peptide (Fig. 4B). In both cases p52Shc j s the major phosphorylated form.. Moreover, alpha6 
integrin is required for She activation, as She phosphorylation is inhibited in response to 
5 expression of a dominant-negative beta4 integrin mutant. This mutant lacks the cytoplasmic 
signaling domain but selectively binds to alpha6 integrin to form a ligand binding, yet signaling- 
defective alpha6/beta4 integrin (Spinardi et a/., 1993) (Fig. 4B) 

Anti-sense RNAs specific for alpha6 integrin and gamma2 laminin, as well as dominant- 
negative beta4 integrin, were expressed and results indicated that integrin alpha6, integrin beta4, 

10 and laminin gamma2 expression are relevant for the survival of Ras+ APCmin cells in 

Ras+ APCmin, Ras, APC and control cells. All three constructs efficiently inhibit the growth of 
Ras+APCmin cells in soft agar (not shown) and when attached to plastic (Fig. 5 Al). In contrast, 
the proliferation of wt, Ras and APCmin cells is not affected by this treatment (Fig.5Al). 
Importantly, the colony formation of Ras+APC cells exposed to any of the three inhibitory 

1 5 constructs can be rescued by co-expression of baculovirus p35, a potent inhibitor of caspase 
activity and of apoptosis (Resnicoff et ai, 1998) (Fig. 5A1). Similarly, co-expression of 
exogenous integrin alpha6 mRNA efficiently rescues Ras-APCmin cell proliferation inhibited by 
alpha6 anti-sense RNA (Fig. 5A1). This is mirrored by a rescue of alpha6 integrin expression on 
the cell surface of alpha6 anti-sense RNA expressing cells (Fig. 5B). Moreover, ectopic alpha6 

20 integrin expression also rescues apoptosis induced by dominant-negative beta4 integrin (Fig. 
5A1). The latter is expressed at constant levels (Fig. 5B). As one would expect, the cells 
expressing laminin gamma2 antisense RNA could not be rescued by integrin alpha6 over- 
expression, but showed significant rescue when plated on dishes coated with laminin gamma2 
peptide (Fig. 5A1). Similarly, this peptide increases the survival of Ras+ APCmin cells in 

25 suspension, as indicated by lower caspase 3 activity (Fig. 2C). In summary, the survival of Ras- 
APCmin cells depends on the expression of alpha6/beta4 integrin and the laminin gamma2 chain. 
Conversely, the inhibition of these gene activities leads to selective killing of transformed cells. 

Data also indicate that cell death due to lack of alpha6/beta4 integrin receptor signaling in 
Ras-APCmin cells is independent of the death signaling pathway involving the induction of 
30 mitochondrial damage and caspase 9 activity. Instead, the data indicate an involvement of the 
Fas/TNF receptor/death domain protein/caspase8 pathway (Kruidering and Evan, 2000) in the 
control of tumor cell survival by alpha6 integrin-containing integrin receptors. 

Control, APCmin and Ras cells, which lack alpha6^eta4 integrin receptor signaling 
activity, show high levels of caspase 8 activity when kept in suspension, while at the same time 
35 caspase 9 activity cannot be detected (Fig. 2D). 
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Although Ras- APCmin cells can be rescued from apoptosis by caspase inhibitor 
baculovirus p35 (Fig. 5 Al, A2 and text above), cell death induced by expression of dominant- 
negative beta4 integrin* cannot be prevented by expression of the survival factor Bcl2 (see Fig. 5 
A2). Bcl2 binds to and neutralizes BH3-domain killer proteins that cause mitochondrial damage, 
5 cytochrome C release and caspase 9 activation (Luo et al., 1998). 

Ras cells which lack alpha6/beta4 integrin receptor signaling activity and cannot survive in 
the absence of ECM contacts, and Ras+APCmin cells which depend on alpha6/beta4 integrin 
receptor signaling for survival, show equivalent levels of AKT phosphorylation (data not shown). 
This indicates that AiCT does not serve as the key target for alpha6/beta4 integrin receptor 
10 signaling. AKT has been described to promote cell survival via phosphorylation and inacrivation 
of Bad, a BH3-domain killer protein (Datta et a/., 1999). 

2. Example 2 Other cancer cells dependent on alpha6beta4 receptor signal 
transduction for growth in the absence of ECM 

The survival of highly transformed primary mouse embryo fibroblasts expressing Ras and 

15 Myc oncoproteins in conjunction with a homozygous ARF null mutation (Kamijo et al., 1997) 

depends on expression and function of alpha6 integrin (Fig. 5C), thus demonstrating that this 

principle also may apply to transformed mesenchymal cells. Moreover, the human colon cancer 

cell line SW480 carrying multiple oncogenic mutations such as activated Ras (Fujita et al., 1988), 

amplified c-Myc (Cherif et al., 1988), a mutated APC allele (Munemitsu et al. s 1995) and a p53 

20 mutation (Abarzua et al., 1995) is effectively killed by anti-sense integrin alpha6 and laminin 

gamma2 RNAs as well as dominant-negative beta4 integrin (Fig. 4D). Similarly, integrin alpha6 

ablation even leads to apoptosis of Ras/APCmin cells in the presence of active SV40 large T (not 

shown). Thus the survival of different types of highly transformed cells depends on alpha6 

integrin expression, beta4 integrin expression and lamininS expression irrespective of the status of 

25 major tumor supressor genes, such as arf, p53 or rb, and for at least sw480 cells, murine colon 

epitheilial cells, and the ras/APCmin cells, alpha6, beta4, and Iaminin5/gamma2 are required. The 

sensitivity of transformed cells to ablation of integrin signaling, for example, A6B4 integrin 

receptor signaling, is thus quite remarkable in its apparent generality. 

Oncogenic mutations co-operate to engage autocrine integrin signaling. Importantly, this 
30 signaling mechanism, involving alpha6 integrin, beta4 integrin and the laminin garnrna2 chain, 
becomes an essential component of the survival mechanism in transformed colonic epithelial cells. 
In addition, fibroblasts and human colon cancer cells also rely on integrin alpha6 for survival. In 
contrast, normal and partially transformed cells that express alpha6 integrin at low levels do not 
require this polypeptide for survival. Thus, integrin signaling inhibition can lead to selective 
35 killing of cancer cells. 
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The laminin-integrin receptor signaling loop is also relevant to the survival of cells 
transformed by other combinations of oncogenic lesions. Introduction of activated Ras together 
with dominant-negative p53 (Lloyd et al., 1997) into murine colonic epithelial cells (D'Abaco et 
al„ 1996; see also above) via retroviral infection supports this. Four distinct polyclonal pools of 
5 infected cells were derived by drug selection: I) Control (beo/neo) cells, infected with two 
retroviruses carrying neomycin (neo) or bleomycin (bleo) drug resistance markers, respectively; 
2)Ras cells, infected with a Ras/neo virus and a virus with the bleo marker; 3) DNp53 cells, 
infected with a DNp53/bleo virus and a virus with the neo marker; and 4) Ras/DNp53 cells, 
infected with a Ras/neo virus and a DNp53/bleo virus. 

10 As the colon epithelial cells contain temperature-sensitive SV40 large T under control of 

the gamma interferon promoter, all cell populations were drug-selected at the permissive 
temperature (33°C) in the presence of gamma interferon. All further experiments were carried out 
at the non-permissive temperature for SV40 large T in the absence of gamma interferon, as 
described above for Ras+APCmin cells. 

1 5 Similar to Ras+APCmin cells, only Ras/DNp53 cells but neither bleo/neo, Ras/bleo or 

DNp53/neo cells were able to grow in the absence of ECM contact in soft agar (not shown). 
Moreover, Ras and DNp53 cooperate in the suppression of caspase 3 activity in suspension (Fig. 
2B), and in the induction of alpha6 integrin expression (Fig. 4A). In Ras/DNp53 cells passaged 
through a single round of soft agar growth, alpha6 integrin is expressed at even higher levels, 

20 demonstrating a correlation between the ability of the cells to survive in the absence of ECM 
contact and alpha6 integrin expression. In Ras/DNp53 cells beta4 integrin expression levels are 
also increased, when compared to controls (Fig. 4A). As expected expression of the laminin 
gamma2 chain is induced in Ras/bleo and Ras/DNp53 cells (Fig. 4B). 

Anti-sense RNAs specific for alpha6 integrin and gamma2 laminin, as well as dominant- 
25 negative beta4 integrin were expressed in Ras/DNp53, Ras/bleo, DNp53/neo and neo/bleo cells. 
All three constructs efficiently inhibit the growth of Ras/DNp53 cells when attached to plastic. In 
contrast, the proliferation of neo/bleo, Ras/bleo and DNp53/neo cells is not affected by this 
treatment (Fig.5 A3). Anti-sense alpha6 integrin or dominant-negative beta4 integrin also inhibit 
the growth of Ras/DNp53 cells in soft agar (Fig. 5A4). 

30 Integrin alpha6 expression can also be induced by activated Ras and DNp53 in primary murine 
colon crypt epithelial cells (not shown). Furthermore introduction of Ras and Myc oncoproteins 
into our control colon epithelial cells also leads to an induction of alphaS expression (not shown), 
suggesting that induction of alpha6 integrin expression may be an integral component of distinct 
oncogene cooperation paradigms 



74 



WO 02/30465 



PCT/US01/32127 



a) Cell types to be tested 

As described herein, there are a variety of cancer and transformed cell types that express 
alpha6 integrin and laminins at high levels (Dedhar etal. t 1993; Koshikawa et al., 1999; Lohi et 
at., 2000; Van Waes and Carey, 1992). As disclosed herein the following cell lines other than the 
5 fibroblasts require alpha6 and beta4 integrin expression, as well as laminin gamma2 chain 
expression: (1) the human colon carcinoma cell line SW480 (data not shown), (2) Ras+APCmin- 
transformed murine colon epithelial cells (see Fig. 3A) Murine fibroblasts transformed by 
activated Ras and Myc in conjunction with homozygous Arf null mutation require alpha6 
expression for their survival. These cells do not express integrin beta4, but express betal integrin, 
10 indicating the importance of alpha6/betal receptors in the transformed fibroblasts. 

b) xxxlnducible expression of inhibitors 

To interrupt alpha6 integrin and laminin gamma2 chain-dependent signaling in tumors 
regulatable alpha6 integrin and laminin gamma2 antisense mRNA expression as well as 
regulatable expression of the beta4 integrin the dominant-negative polypeptide described in 
15 Section C were used. Constitutive expression of these inhibitors leads to rapid cell death in tissue 
culture. In contrast, the establishment of clonal cell lines with inducible expression of anti-sense 
mRNAs or dominant-negative beta4 integrin minimize such limitations. 

The doxycycline (dox)-inducible reverse tetracycline transactivator (rtTA) is frequently 
used to overexpress transgenes in a temporally regulated fashion in vitro and in vivo (Efrat et ai, 

20 1995; Gossen et al. f 1995; Ray et a/., 1997). These systems are, however, often compromized by 
the levels of gene expression in the absence of dox administration. The tetracycline controlled 
transcriptional silencer (tTS), a fusion protein containing the tet repressor and the KRAB-AB 
domain of the kid-1 transcriptional repressor, is inhibited by doxycycline. As shown in tissue 
culture (Freundlieb et a/., 1999) and in transgenic mice (Zhu et al. t 2001), tTS tightens the control 

25 of transgene expression in rtTA-based systems, i.e. tTS effectively eliminates leaky baseline 
expression without altering the inducibility of rtTA-regulated genes. Thus optimal "of&on" 
regulation of gene expression can be accomplished with the combined use of tTS and rtTA* The 
complete expression system is commercially available from CLONTECH. This system can be 
used for the preparation of all inducible cell lines. 

30 Another regulatable system comprising the estrogen-dependent transactivator GalER- 

VP16 and a promoter under the control of Gal4 DNA binding sites (Braselmann et al., 1993) can 
be used. (Perez-Roger et at., 1997). This particular experimental set up, was used herein to show 
that the induction of the beta4 integrin dominant-negative mutant in SW480 colon cancer cells 
induces apoptosis in vitro, as measured by caspase 3 activation (Fig. 2B). For in vivo use, 

35 however, a point mutation has to be introduced into the GalER-VP16 transactivator that eliminates 
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its sensitivity to estrogen while retaining its response to the anti-estrogen 40H-tamoxifen 
(Littlewood et al, 1995). 40H-tamoxifen has been demonstrated to reversibly regulate in vivo the 
activity of another regutatable transactivator, the MycER chimera (Pelengaris et al., 1999). 

Typically, inducible cell lines are first stably transfected with the anti-sense or dominant- 
5 negative constructs coupled to the regulatable promoters into the four test cell lines. Subsequently, 
the activator is introduced and the repressor, such as rcTA and tTS (see above) via infection using 
recombinant retroviruses with different selectable markers 
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E. Sequences 

1. SEP ID NO:1 Human alpha6 integrin cds (acc# 4557674) 

50 atggccgccgccgggcagctgfcgcttgctctacctgfccggcggggctcctgtcccggctcggcgcagccttcaacfctggac 
actcgggaggacaacgtgatccggaaatatggagaccccgggagcctcttcggcttctcgctggccatgcactggcaactg 
cagcccgaggacaagcggctgttgctcgtgggggccccgcgcggagaagcgcbtccactgcagagagccaacagaacggga 
gggctgtacagcfcgcgacatcaccgcccgggggccatgcacgcggabcgagttfcgataacgatgcfcgaccccacgtcagaa 
agcaaggaagatcagbggabgggggtcaccgbccagagccaaggfcccagggggcaaggbcgbgacatgbgcbcaccgatab 

55 gaaaaaaggcagcabgttaatacgaagcaggaabcccgagacabcbbtgggcggbgttafcgbcctgagbcagaabctcagg 
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atbgaagacgatabggatgggggagattggagctbttgtgatgggcgattgagaggccatgagaaatttggctcttgccag 
caaggbgbagcagcbacbbbtacbaaagacbbbcabbacabbgbabbbggagccccgggbacbbabaacbggaaagggat c 
gtbcgfcgbagagcaaaagaataacacttbtbbbgacabgaacatcttbgaagabgggccbtatgaagbbggtggagagact 
gagcabgabgaaagbcbcgbbccbgbbccbgcbaacagbbacbbaggbbcbbcbtbggacbcagggaaaggbabbgbbbcb 
5 aaagabgagatcactbbbgtabcbggbgcbcccagagccaabcacagbggagccgbggbbbbgccgaagagagacabgaag 
bcbgcacabctccbcccbgagcacatabbcgabggagaaggtctggccbcbtcatbbggctabgabgbggcggtggbggac 
ctcaacaaggabgggtggcaagatabagtbabbggagccccacagtabttbgatagagabggagaagbtggaggtgcagtg 
tatgtctacabgaaccagcaaggcagabggaabaabgbgaagccaabtcgtcbbaabggaaccaaagabbcbatgttbggc 
abbgcagbaaaaaabatbggagababbaabcaagabggcbacccagababbgcagbbggagcbccgbabgabgacbbggga 

10 aaggbbtfcbabcbabcabggabcbgcaaabggaabaaabaccaaaccaacacaggbbcbcaagggtababcaccbtabbbb 
ggababbcaabtgcbggaaacabggaccbbgabcgaaatbccbacccbgabgbbgctgttggbbcccbcbcagatbcagba 
acbabbtbcagabcccggcctgbgafcbaabatbcagaaaaccabcacagbaacbccbaacagaacbgacctccgccagaaa 
acagcgtgbggggcgccbagbgggabatgcctccaggtbaaabcctgbbbbgaababacbgctaaccccgcbggbtabaat 
ccbbcaababcaabbgbgggcacacbbgaagcbgaaaaagaaagaagaaaabcbgggcbabcctcaagagtbcagtttcga 

15 aaccaaggttctgagcccaaababactcaagaacbaacbctgaagaggcagaaacagaaagbgbgcabggaggaaacccbg 
bggcbacaggabaababcagagataaacbgcgbcccabbcccabaacbgccbcagtggagabccaagagccaagcbcbcgb 
aggcgagbgaabbcacbbccagaagttcbbccaabbcbgaabbcagatgaacccaagacagcbcababbgabgbbcacttc 
bbaaaagagggatgtggagacgacaabgbafcgbaacagcaacctbaaacbagaababaaabbbbgcacccgagaaggaaab 
caagacaaattbbctbabbbaccaabbcaaaaaggbgbaccagaactagbbcbaaaagabcagaaggababbgcbbbagaa 

20 abaacagbgacaaacagcccttccaacccaaggaabcccacaaaagatggcgabgacgcccabgaggcbaaactgabtgca 
acgtbtccagacacbbbaaccbabbctgcababagagaacbgagggcbttccctgagaaacagbbgagbbgbgbbgccaac 
cagaabggcbcgcaagctgacfcgtgagctcggaaabcctbbtaaaagaaabtcaaatgfccactttbbabbbggbbtbaagt 
acaacbgaagfccaccbbbgacaccccabafccbggatabbaafccbgaagbbagaaacaacaagcaabcaagabaatbbggcb 
ccaabbacagcbaaagcaaaagfcggbbattgaacbgcbbbbabcggbcbcgggagtfcgctaaacctbcccaggtgbabbbb 

25 ggaggbacagfcbgbbggcgagcaagcbabgaaabcbgaagabgaagbgggaagtfcbaabagagbabgaabbcagggbaaba 
aacbbaggbaaaccbcbbacaaaccbcggcacagcaaccfctgaacabbcagbggccaaaagaaabbagcaabgggaaabgg 
bbgcbbbabbbggbgaaagtagaabccaaaggabtggaaaaggbaacbtgbgagccacaaaaggagabaaacbccctgaac 
ctaacggagbctcacaacbcaagaaagaaacgggaaabbactgaaaaacagabagabgataacagaaaabbttcbbbabtt 
gctgaaagaaaabaccagactcbbaacbgbagcgbgaacgbgaacbgbgbgaacabcagabgcccgcbgcgggggcbggac 

30 agcaaggcgtcbctbabbbbgcgctcgaggbtabggaacagcacabbtcbagaggaatattccaaacbgaacbactbggac 
abbcbcabgcgagccbtcabtgabgbgactgcbgcbgccgaaaababcaggcbgccaaabgcaggcacbcaggbbcgagbg 
acbgbgbtbcccbcaaagactgbagctcagbabbcgggagbaccbtggtggabcatcctagbggcbabbcbcgcbgggatc 
btgabgcbtgcbbtabbagtgbbbabacbabggaagbgbggbbbcbbcaagagaaabaagaaagabcabbabgabgccaca 
batcacaaggcbgagabccabgcbcagccatcbgabaaagagaggcbbacbtctgatgcabag 

35 2. SEP ID NO:2 Human integrin alpha6 protein sequence (acc# 

NP 000201): 

1 maaagqlcll ylsagllsrl gaafnldbre dnvirkygdp galfgfalam hwqlqpedkr 
61 lllvgaprge alplqranrb gglyscdiba rgpcbriefd ndadpbsesk edqwmgvbvq 
121 sqgpggkwb cahryekrqfc vnbkqesrdi fgrcyvlsqn Irieddmdgg dwsfcdgrlr 

40 181 ghekfgscqq gvaabfbkdf hyivfgapgb ynwkgivrve qfcnnbffdmn ifedgpyevg 
241 gebehdeslv pvpansylgf sldsgkgivs kdeibfvsga pranhsgaw llkrdmksah 
301 llpehifdge glassfgydv awdlnkdgw qdivigapqy fdrdgevgga vyvyninqqgr 
361 wnnvkpirln gbkdsmfgia vknigdinqd gypdiavgap yddlgkvfiy hgsanginbk 
421 pbqvlkgisp yfgyaiagnm dldrnsypdv avgslsdavb ifrarpvini qkbibvbpnr 

45 481 idlrqkbacg apagiclqvk scfeybanpa gynpsisivg bleaekerrk sglssrvqfr 
541 nqgsepkytq eltlkrqkqk vcmeeblwlq dnirdklrpi pibasveiqe pssrrrvnsl 
601 pevlpilnsd epktahidvh Clkegcgddn vcnsnlkley kfcbregnqd kfsylpiqkg 
661 vpelvlkdqk dialeibvtn spsnprnpbk dgddaheakl iabfpdtlty sayrelrafp 
721 ekqlscvanq ngsqadcelg npfkrnsnvb fylvlatbev bfdtpyldin lklebtsnqd 

50 781 nlapibakak wielllava gvakpsqvyf ggbwgeqam kaedevgali eyefrvinlg 
841 kpltnlgbat lniqwpkeia ngkwllylvk veskglekvt cepqkeinal nlbeahnark 
901 kreibekqid dnrkfalfae rkyqtlncav nvncvnircp lrgldskaal ilrarlwnsb 
961 fleeyaklny Idilmrafid vbaaaenirl pnagbqvrvt vfpskbvaqy agvpwwiilv 
1021 ailagilmla llvCilwkcg ffkrnkkdhy dabyhkaeih aqpsdkerlc sda 

55 3, SEP IP NO:3 Mus musculus alpha6 integrjn cds (acc# 71 10658); 

abggccgbcgcgggccagbtgbgcctgcbcbaccbgbccgcggggcbbcbagcccggcbgggtacagccbtcaacctggac 
acccgcgaggacaacgtgabccggaaabcgggggatcccgggagcctctbcggctbctcgctcgccatgcactggcagbbg 
cagccggaggacaagcggcbgbbgctbgtgggggcaccbcgggcagaagcacbcccgcbgcagagggcgaacagaacaggg 
ggccbgbacagcbgbgacabcacctcccgaggaccbbgbacacggafctgaabbbgabaabgacgcbgatccbabgbcagaa 

60 agcaaggaagaccagbggabgggagbcacbgbccagagccaaggbccagggggcaaagbggbgacgbgbgcacabcgabab 
gagaaacggcagcacgtcaacacgaagcaggagbcgcgggababcbtbggaagabgbbatgtccbgagbcagaabcbcaga 
ab bgaaga fcgababggacggaggagacbggagbbbcbgcgabggccggb bgagaggccabgaaaagbb bggc bccbgbcag 
caaggagbagcggcbacbtbcacbaaggacbtbcatbacattgbbfcbbggagccccagggacbbacaacbggaaagggabc 
gbccgbgtagaacaaaagaataacacbbbbbtbgacabgaacatcbbbgaagatgggcccbabgaagbtggbggagagaca 

65 gatcatgabgaaagtctcgtgcccgtbccfcgcbaacagtbaccbaggcbtttcgctggacbcagggaagggbabtgbbtct 
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aaagabgacabcactbbbgtgbctggbgctccaagagccaabcacagtggggcbgtagbgbbgcbaaaaagagacabgaag 
bccgcacatctgctcccbgagbababatbbgacggagaaggccbggcbbccbcgfcbbggctabgabgtggcagbggtggac 
cbcaabgcagabgggbggcaagacatcgttabcggagctccacagtattbbgabagggabggtgaagtcgggggtgcagbt 
tacgtctacattaaccagcaaggcaaatggagtaatgtgaagccgattcgtctaaatgggaccaaagactcgatgtttgga 
5 atctctgbgaaaaatatag'gtgatabtaaccaagabggcbatccagatattgctgttggagcbccctatgatgabctgggg 
aaggtttbbatctabcatggatccccgacbggcabaabbaccaagccaacacaggttctcgaggggacabcgccbtactbc 
ggcbabtcaabcgcbgggaababggacctggatcggaabbccbaccccgaccttgctgbgggctccctcbcagacbcggbc 
actabttbcagabcccggccagbgattaacabbcbaaaaaccabcacagtgactcctaacagaabtgacctccgccagaag 
bccabgbgbggcbcaccbagcgggababgccbcaaggbbaaagccbgbbfcbgaatabacbgcgaaaccbbccggbbataac 

10 cctccaabatcaabttbgggtattctbgaagctgaaaaagaaagaagaaaatcagggttgtcatcgagagttcagtttcga 
aaccaaggttccgagccaaagbatactcaggagctgacccbgaatcggcagaagcagcgggcgtgcatggaggagacccbc 
cggctgcaggagaacatcagagacaagctgcgbcccatccccabcacggcbtctgcggagatccaggagcccacgtctcgc 
cggcgggbgaactcactccccgaagtbcttcccafccctgaabtcaaabgaagccaaaacggbccagacagatgbccacbbc 
btaaaggaaggabgtggagacgacaatgtctgtaacagcaacctbaagctagagtabaaatbtggtacccgagaaggaaab 

15 caagacaaabtctcbtaccbtccaabtcaaaaaggcatcccagaabtagtcctaaaagabcagaaagabatagctctggaa 
ataacggbgaccaacagcccttcggatccaaggaatccccggaaagatggcgacgabgcccatgaagccaaacbcatcgcc 
acgtbbccagacactcbgacatattccgcbtacagagaactgagggctttccctgagaagcagctgagctgbgcggccaac 
cagaatggctcccaagccgactgtgagctcggaaatccttbcaagagaaattccagbgttactttctatctgattttaagt 
acaaccgaggbcaccttbgacaccacagatctggabattaatctgaagttggaaacaacaagcaatcaggabaaattggct 

20 ccaattacagcgaaggcaaaagtggtbattgaabtgcttttatccctcbccggagbcgctaagcctbcgcaggtgtattbb 
ggaggtacagttgttggbgagcaagctatgaaatctgaagatgaagfcaggaagtttaatagagbatgaatbbagggtgatt 
aacttaggcaagcctcbtaaaaacctcggcacagcaaccttgaatatacagtggcccaaggagattagcaatggcaaatgg 
btgctbtabbbgatgaaagbtgaabccaaaggttbggagcagattgtbbgbgagccacacaabgaaataaactacctgaag 
cbgaaggagbcfccacaactcaagaaagaaacgggaacttcctgaaaaacagatagatgacagcaggaaactttctttattt 

25 cctgaaagaaaataccagactctcaactgcagcgtcaacgtcaggbgtgfcgaacatcaggtgcccactgcgagggctggac 
acgaaggcctctctcgbtcfcgtgttccaggttgtggaacagcacabttctagaggaatabtccaaactgaactacttggac 
attctcgbgagggcttccabagatgtcaccgctgctgctcagaatatcaagctcccbcacgcgggcactcaggttcgagbg 
acggfcgbbtcccbcaaagactgtagctcagtabtcaggagbagcbtggbggatcabcctcctggcbgfcbcttgccgggatb 
cbgatgctggctctabbagtgttbbbactgtggaagtgtggcttcbtcaagagaaabaagaaagatcabbacgabgccacc 

30 batcacaaggcbgagabccabactcagccgbctgabaaagagaggcbtacttccgabgcatag 

4. SEP ID NO: 4 Mus musculus alpha6 integrin protein sequence (acc# 
frP_ Q32423) 

1 mavagqlcll ylsagllarl gfcafnldbre dnvirksgdp gslfgfslam hwqlqpedkr 
61 lllvgaprae alplqranrt gglyscdits rgpcbriefd ndadpmsesk edqwmgvtvq 

35 121 sqgpggkwb cahryekrqh vntkqesrdi fgrcyvlsqn lrieddmdgg dwsfcdgrlr 
181 ghekfgscqq gvaatfbkdf hyivfgapgt ynwkgivrve qknnbf fdmn ifedgpyevg 
241 getdhdeslv pvpansylgf sldsgkgivs kddibfvsga pranhsgaw llkrdmksah 
301 llpeyifdge glassfgydv awdlnadgw qdivigapqy fdrdgevgga vyvyinqqgk 
361 wsnvkpirln gtkdsmfgis vknigdinqd gypdiavgap yddlgkvf iy hgsptgiitk 

40 421 pbqvlegbsp yfgysiagnm dldrnsypdl avgslsdsvb ifrsrpvini lkfcitvtpnr 
481 idlrqksmcg spsgiclkvk acfeybakpe gynppisilg ileaekerrk aglssrvqfr 
. 541 nqgsepkytq elblnrqkgr acmeetlwlq enirdklrpi pibasveiqe ptsrrrvnsl 
601 pevlpilnsn eaktvqtdvh flkegcgddn vcnsnlkley kfgtregnqd kf sylpiqkg 
561 ipelvlkdqk dialeitvtn spsdprnprk dgddaheakl iatfpdtlty sayrelrafp 

45 721 ekqlscvanq ngsqadcelg npfkmssvt fylilsttev tfdttdldin lkletbsngd 
781 klapibakak wielllsls gvakpsqvyf ggtwgeqam ksedevgsli eyefrvinlg 
841 kplknlgbat Iniqwpkeis ngkwllylmk veskgleqiv cephneinyl klkeshnsrk 
901 krelpekqid dsrkfslfpe rkyqblncsv nvrcvnircp Irgldtkael vlcsrlwnsb 
961 fleeysklny Idilvrasid vbaaaqnikl phagbqvrvb vCpsktvaqy sgvawwiill 

50 1021 avlagilmla llvfllwkcg ffkrnkkdhy dafcyhkaeih tqpsdkerlb sda 

5. SEP ID NP:5 Human integrin beta4 subunit cds facc# 6453379): 

abggcagggccacgccccagcccabgggccaggctgctccbggcagcctbgabcagcgbcagccbcbcbgggaccbtgaac 
cgctgcaagaaggccccagtgaagagcbgcacggagtgtgtccgtgtggataaggacbgcgcctactgcacagacgagatg 
btcagggaccggcgctgcaacacccaggcggagcbgcbggccgcgggcbgccagcgggagagcabcgtggbcatggagagc 

55 agcttccaaatcacagaggagacccagattgacaccaccctgcggcgcagccagabgbccccccaaggccbgcgggbccgb 
cbgcggcccggbgaggagcggcabttbgagcbggaggbgbbbgagccacbggagagccccgbggaccbgbacabcctcabg 
gacbtctccaactccabgbccgabgabctggacaaccbcaagaagabggggcagaaccbggcbcgggbcctgagccagctc 
accagcgacbacacfcatbggabbbggcaagbbbgtggacaaagbcagcgbcccgcagacggacabgaggccbgagaagctg 
aaggagccbbggcccaacagbgacccccccbbcbccbtcaagaacgbcatcagcctgacagaagabgbggatgagttccgg 

60 aabaaacbgcagggagagcggabctcaggcaaccbggabgcbccbgagggcggcbtcgatgccatccbgcagacagctgtg 
tgcacgagggacattggcbggcgcccggacagcacccacccgctggtcbbcbccaccgagbcagccttccactatgaggct 
gatggcgccaacgbgcbggctggcabcatgagccgcaacgabgaacggfcgccacctggacaccacgggcaccbacacccag 
bacaggacacaggacbacccgbcggbgcccacccfcggbgcgccfcgcfccgccaagcacaacabcabccccabctbbgcbgtc 
accaacbacbccbabagcbactacgagaagcbbcacaccbabbtcccbgbcbccbcacbgggggbgcbgcaggaggactcg 

65 tccaacabcgtggagctgcbggaggaggccbbcaabcggabccgcbccaaccbggacatccgggcccbagacagcccccga 
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ggccttcggacagaggbcacctccaagabgtbccagaagacgaggacbgggbccbfcbcacatccggcggggggaagtgggt 
atataccaggtgcagctgcgggcccbtgagcacgtggatgggacgcacgtgtgccagctgccggaggaccagaagggcaac 
abccabctgaaaccbbccbbcbccgacggccbcaagabggacgcgggcabcabctgbgabgbgbgcacctgcgagcbgcaa 
aaagaggbgcggbcagctcgcbgcagcbbcaacggagacttcgtgbgcggacagbgbgtgbgcagcgagggctggagbggc 
5 cagaccbgcaacbgcbccaccggcbcbcbgagbgacabbcagcccbgccbgcgggagggcgaggacaagccgbgcbccggc 
cgtggggagtgccagtgcgggcactgtgtgtgctacggcgaaggccgctacgagggtcagttctgcgagtatgacaacttc 
cagbgtccccgcacbtccgggbbcctctgcaabgaccgaggacgctgctccatgggccagbgtgtgtgtgagcctggbtgg 
acaggcccaagcbgtgacbgtccccbcagcaabgccaccbgcabcgacagcaabgggggcabcbgtaatggacgbggccac 
tgtgaghgtggccgctgccactgccaccagcagtcgctctacacggacaccatctgcgagatcaactactcggcgfcccacc 

10 cgggccbcbgcgaggacctacgcbcctgcgbgcagbgccaggcgtggggcaccggcgagaagaaggggcgcacgbgbgagg 
aabgcaacttcaaggtcaagabggbggacgagcbbaagagaggcgaggaggbggbggtgcgctgccccbtccgggacgagg 
abgacgacbgcacctacagctacaccatggaaggbgacggcgcccctgggcccaacagcactgtcctggtgcacaagaaga 
agggactgccctccgggcbcctbctggtggcbcabccccctgcbccbcctccbcctgccgcbcctggccctgctactgctg 
ctabgcbggaagbactgtgcctgctgcaaggcctgcctggcactbcbcccgbgctgcaaccgaggtcacatggtgggcttt 

15 aaggaagaccacbacabgctgcgggagaaccbgabggccbcbgaccacbtggacacgcccabgctgcgcagcgggaacctc 
aagggccgbgacgbggbccgctggaaggbcaccaacaacabgcagcggccbggcbbbgccacbcatgccgccagcabcaac 
cccacagagctggbgcccbacgggcbgbccbbgcgccbggcccgcctbbgcaccgagaaccbgcbgaagccbgacacbcgg 
gagbgcgcccagcbgcgccaggaggbggaggagaaccbgaacgaggbcbacaggcagabcbccggbgfcacacaagcbccag 
cagaccaagbbccggcagcagcccaabgccgggaaaaagcaagaccacaccabbgbggacacagbgctgatggcgccccgc 

20 tcggccaagccggccctgcbgaagcbbacagagaagcaggtggaacagagggccttccacgaccbcaaggbggcccccggc 
tactacaccctcactgcagaccaggacgcccggggcabggtggagbtccaggagggcgbggagctggbggacgtacgggtg 
cccctcbbbabccggccbgaggabgacgacgagaagcagcbgcbggtggaggccabcgacgbgcccgcaggcacbgccacc 
cbcggccgccgccbggbaaacabcaccatcabcaaggagcaagccagagacgbggbgbccbttgagcagccbgagttctcg 
gbcagccgcggggaccaggbggcccgcabcccbgbcabccggcgbgfccctggacggcgggaagbcccaggbctccbaccgc 

25 acacaggabggcaccgcgcagggcaaccgggacbacabccccgbggagggtgagcbgcbgbbccagccbggggaggccbgg 
aaagagcbgcaggbgaagcbccbggagcbgcaagaagbbgacbccctccbgcggggccgccaggbccgccgbtbccacgbc 
cagctcagcaaccctaagbtbggggcccacctgggccagccccacbccaccaccabcatcatcagggacccagatgaactg 
gaccggagcttcacgagtcagabgbtgbcabcacagccacccccbcacggcgacctgggcgccccgcagaaccccaatgcb 
aaggccgcbgggbccaggaagabccabbtcaacbggcbgcccccbbcbggcaagccaabggggbacagggbaaagbactgg 

30 atbcagggbgacbccgaabccgaagcccacctgcbcgacagcaaggtgcccbcagtggagcbcaccaaccbgbacccgbab 
bgcgacbabgagabgaaggtgbgcgccbacggggcbcagggcgagggacccbacagcbcccbggbgbcctgccgcacccac 
caggaagbgcccagcgagccagggcgbcbggccbbcaabgbcgbcbccbccacggbgacccagcbgagcbgggcbgagccg 
gcbgagaccaacggbgagabcacagccbacgaggbctgcbatggcctggbcaacgatgacaaccgaccbabbgggcccatg 
aagaaagtgctggttgacaacccbaagaaccggatgcbgcbtatbgagaacctbcgggagbcccagcccbaccgctacacg 

35 gbgaaggcgcgcaacggggccggctgggggccbgagcgggaggccatcatcaaccbggccacccagcccaagaggcccatg 

bccabccccatcabccctgacatcccbabcgbggacgcccagagcggggaggacbacgacagcbbccbbabgtacagcgab j 
gacgbbcbacgcbctccabcgggcagccagaggcccagcgbcbccgabgacacbggcbgcggcbggaagbbcgagccccbg 
cbgggggaggagcbggaccbgcggcgcgbcacgbggcggcbgcccccggagcbcabcccgcgccbgbcggccagcagcggg 
cgcbccbccgacgccgaggcgccccacgggcccccggacgacggcgccgcgggcgggaagggcggcagccbgccccgcagb 

40 gcgacacccgggccccccggagagcaccbggbgaatggccggatggacbbbgccbtcccgggcagcaccaacbcccbgcac 
aggatgaccacgaccagtgctgcbgccbatggcacccaccbgagcccacacgtgccccaccgcgbgctaagcacatcctcc 
acccbcacacgggacbacaacbcacbgacccgctcagaacacbcacacbcgaccacacbgccgagggactacbccacccbc 
accfcccgbcbcctcccacggcctccctcccabcbgggaacacgggaggagcaggcbbccgctgtcctgggccctggggtcc 
cggagbcgggcbcagabgaaagggbbccccccbbccaggggcccacgagactcbataabccbggctgggaggccagcagcg 

45 cccbcctggggcccagacbcbcgccbgacbgcbggtgbgcccgacacgcccacccgccbggtgbbctcbgcccbggggccc 
acabcbcbcagagfcgagcbggcaggagccgcggbgcgagcggccgctgcagggcbacagbgbggagbaccagcbgcbgaac 
ggcggbgagcbgcabcggcbcaacabccccaaccctgcccagaccbcggbggbggtggaagacctccbgcccaaccacbcc 
bacgtgttccgcgbgcgggcccagagccaggaaggcbggggccgagagcgbgagggtgfccabcaccabbgaabcccaggtg 
cacccgcagagcccactgtgtcccctgccaggcbccgccttcactbbgagcactcccagtgccccaggcccgctggtgtbc 

50 acbgccctgagcccagactcgcfcgcagcbgagcbgggagcggccacggaggcccaabggggababcgbcggcfcaccbggbg 
acctgbgagabggcccaaggaggaggbccagccaccgcabtccgggbggabggagacagccccgagagccggcbgaccgbg 
ccgggccbcagcgagaacgbgcccbacaagbbcaaggfcgcaggccaggaccacbgagggcbbcgggccagagcgcgagggc 
atcabcaccatagagtcccaggabggaggbccctbcccgcagcbgggcagccgbgccgggctcbtccagcacccgctgcaa 
agcgagtacagcagcabcbccaccacccacaccagcgccaccgagcccbbcctagtgggbccgacccbgggggcccagcac 

55 ctggaggcaggcggctccctcacccggcabgtgacccaggagbttgtgagccggacactgaccaccagcggaacccttagc 
acccacabggaccaacagbbcttccaaacttga 

6. SEP ID NO:6 Human bcta4 integrin protein sequence (accfl CAB61345V 

1 magprpspwa rlllaalisv slsgblnrck kapvkscbec vrvdkdcayc bdemfrdrrc 
61 ntgaellaag cqxesiwrae ssfgibeetq idbblrrsqm spqglrvrlr pgeerhfele 
60 121 vfeplespvd lyilmdfsns msddldnlkk mgqnlarvls qlbsdybigf gkfvdkvsvp 
1B1 qbdmrpeklk epwpnsdppf afknvislbe dvdefrnklq gerisgnlda peggfdailg 
241 bavebrdigw rpdsbhllvf sbesafhyea dganvlagim srnderchld bbgbybqyrb 
3 01 qdypsvpblv rllakhniip ifavbnysys yyeklhtyfp vsalgvlqed ssnivellee 

3 61 afnrirsnld iraldsprgl rfcevbskmfq kbrbgsfhir rgevgiygvq Iralehvdgfc 
65 4 21 hvcqlpedqk gnihlkpsfs dglkmdagii cdvcbcelqk evrsarcsfn gdfvcgqcvc 

4 81 segwsgqtcn csbgslsdiq pclregedkp csgrgecqcg hevcygegry egqfceydnf 
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541 qcprbsgflc ndrgrcamgq cvcepgwbgp scdcplsnat cidsnggicn grghcecgrc 
601 hchqqslytd biceinysas trasartyap acsarrgapa rrrgarvrna tsrsrwwbsl 
661 rearrwv/caa psgbrmtbap babpwkvbap Igptalswcb rrrdcppgsf wwliplllll 
721 lpllalllll cwkycaccka clallpccnr ghmvgfkedh ymlrenlmas dhldbpmlrs 
5 781 gnlkgrdwr wkvbnnmqrp gfabhaasin pfcelvpygls lrlarlcten llkpdtreca 
841 qlrqeveenl nevyrqisgv hkiqqbkfrq qpnagkkqdh tivdtvlmap rsakpallkl 
901 bekqveqraf hdlkvapgyy blbadqdarg nvefqegvel vdvrvplEir pedddekqll 
961 veaidvpagt ablgrrlvni biikeqardv vsfeqpefsv srgdgvarip virrvldggk 
1021 S( ^ vs y rt( 5 d 9 baqgnrdyip vegellfqpg eawkelqvkl lelqevdsll rgrqvrrfhv 

10 1081 qlsnpkfgah lgqphsttii irdpdeldrs etsqmlssqp pphgdlgapq npnakaagsr 
1141 kihfnwlpps gkpmgyrvky wiqgdsesea hlldskvpsv eltnlypycd yemkvcayga 
1201 qgegpysslv scrbhqevps epgrlafnw sstvtqlswa epaebngeib ayevcyglvn 
1261 ddnrpigpmk kvlvdnpknr mllienlres qpyrytvkar ngagwgpere aiinlatqpk 
1321 rptnsipiipd ipivdaqsge dydsflmysd dvlrspsgsq rpsvsddtgc gwkfepllge 

15 1381 eldlrrvtwr lppeliprls assgrssdae aphgppddgg aggkggslpr satpgppgeh 
1441 lvngrmdfaf pgstnslhrm bbbsaaaygb hlsphvphrv lstsstltrd ynslfcrsehs 
1501 hsttlprdys tltsvsshgl ppiwehgrsr Iplswalgsr sraqmkgfpp srgprdsiil 
1561 agrpaapswg pdsrltagvp dtpfcrlvfsa lgptslrvsw qeprcerplq gysveyqlln 
1621 ggelhrlnip npaqtsvwe dllpnhsyvf rvraqsqegw greregviti esqvhpqspl 

20 1681 cplpgsaftl stpsapgplv ftalspdslq lswerprrpn gdivgylvbc emaqgggpat 
1741 afrvdgdspe srlbvpglse nvpykfkvqa rbbegfgper egiibiesqd ggpfpqlgsr 
1801 aglfqhplqa eyssisttht satepflvgp tlgaqhleag gsltrhvbqe fvsrblbbsg 
1861 blsbhmdqqf fqb 

7. SEP ID N0:7 Mouse integrin beta4 subunit mRNA (not only cds) (acctf 
25 L04678^: 

ggaccgbcgaggcagcgggactgacccagctgggctcacbgtabbaagaagcggacccgcgacccggagcgcccggggacc 
cgatcbgggagccbggacgggtgcagcgcgcaggaabgcagfcccgccbgacbcaccagcgccfccctbccbacctgcgccgc 
ccgbccabaaagcgcbgcbcgtcccgcccgccgccgccgcccbgcbgbcccgccgggcbcgcccgcgcgctcagctcgacc 
caacgcagcccaagbccgaggtagtctcactaaggaggaggaggabggcagggcccbgbtgcagcccabgggtgaagctgc 

30 tgcbgcbggcacgaatgctgagbgccagccbcccbggagaccbggccaaccgcbgcaagaaggctcaggbgaagagcbgba 
ccgagtgcatccgggbggacaagagctgbgccbacbgcacagacgagcbgbbcaaggagaggcgcbgcaacacccaggcgg 
acgbbcbggcfcgcaggctgcaggggagagagcatcctggbcabggagagcagccbbgaaabcacagagaacacccagabcg 
tcaccagccbgcaccgcagccaggbabcbccccaaggcctgcaagbccggcbgcggcggggbgaggagcgcacgbbbgbgb 
bccaggbcfcbbgagccccbggagagccccgbggabcbcbabatccbcabggacbbcbccaacbccabgbcbgacgabcbgg 

35 acaaccbcaagcagabggggcagaacctggccaagabccbgcgccagcbcaccagcgacbacaccattggabbtggaaagt 
ttgtggacaaagbcagcgtcccacagacagacabgaggcccgagaaacbgaaggagcccbggcccaacagbgatcccccgb 
tcbccttcaagaacgbbatcagctbaaccgagaatgtggaagaattctggaacaaactgcaaggagaacgcabcbcaggca 
accbggacgcbccbgaagggggcbbbgabgccabccbgcagacagcbgbgbgcacaagggacabbggcbggagggcbgaca 
gcacccaccbgcbggbgbtctccaccgagbcbgccbbccacbacgaggctgabggbgccaacgbbctggccggcabcabga 

40 accgcaabgatgagaaabgccaccbggacgccbcgggcgccbacacccaabacaagacacaggacbacccabcagbgccca 
cgcbggbbcgccbgcttgccaagcabaacafccabccccabcbbbgcbgbcaccaacbacbcbbacagctacbabgagaagc 
bccabaagtatttccccgbcfcccbcbctgggcgtccbgcaggaggabbcabccaacatcgbggagcbgcbggaggaggccb 
bctabcgaabbcgcbccaacctggacatccgggcbcbggacagccccagaggcctgagaacagaggtcaccbccgatacbc 
bccagaagacggagactgggtcctbbcacabcaagcggggggaagtgggcacabacaabgtgcabcbccgggcagtggagg 

45 acabagabgggacacatgtgtgccagcbggcbaaagaagaccaaggcggcaacabccacctgaaacccbccbbctctgabg 
gccbccggabggacgcgagtgbgabcbgbgacgtgbgcccctgtgaccbgcaaaaggaagtbcgabcagcbcgcbgtcacb 
bcagaggagacbbcabgbgbggacacbgtgbgbgcaatgagggctggagbggcaaaaccbgcaacbgcbccaccggcbcbc 
bgagbgacacacagcccbgccbgcgtgagggbgaggacaaaccgtgcbcgggccacggcgagbgccagbgcggacgcbgbg 
bgbgcbabggbgaaggccgctacgagggbcacbbcbgcgagbabgacaacbtccagbgtccccggaccbctggabbccbgb 

50 gcaatgaccggggacgcbgttcbabgggagagbgtgbgbgbgagccbggbbggacaggccgcagcbgcgacbgbcccctca 
gcaatgccacctgcabcgafcagcaacgggggcabcbgcaacggccgaggcbactgtgagtgbggccgtbgbcactgcaacc 
agcagbcgcbcbacacggacaccaccbgfcgagabcaacbacbcbgcgabacbgggbcfccbgbgaggabcbccgcbccbgcg 
bacagbgccaggcctggggcaccggggagaagaaagggcgcgcgbgbgacgabbgccccbbbaaagbcaagabggbagacg 
agcbfcaagaaagaagaggtggtggagbacbgcbccbbccgggabgaggabgacgacbgcacbbacagcbacaacgbggagg 

55 gcgacggcagcccfcgggcccaacagcacagbccbggbccacaaaaagaaagacbgccbcccggcbccbbccbggbggcbca 
bcccccbgcbcabctbccfcccbgbbgcbccbggcgbbgcbbcbgcbgcbcbgcbggaaabactgtgccbgcbgcaaagccb 
gccbggggcbtcbtccbbgctgcaaccgaggbcacabggtgggctttaaggaagatcactatacgcttcgggagaacctga 
bggccbcbgaccacctggacacgcccabgcbacgaagcgggaaccbcaagggacgagacacagbccgcbggaagabcacca 
acaatgbgcagcgcccbggcbbtgccacccatgccgccagcaccagccccacggagcbcgbaccctacgggcbgbcccbgc 

60 gccttggccgccbcbgcacbgagaaccbbabgaagccgggcacccgagagtgbgaccagcbacgccaggaggbggaggaaa 
abcbgaabgaggtgbabagacaggbcagcggcgcacacaagcbccagcagacgaagbbccgacagcagcccaacgccggga 
aaaagcaagaccacaccabbgbggacacagbgbbgctggcgccccgcbccgccaagcagabgcbgcbgaagcbgacagaga 
agcaggbggagcaggggbccfcbccabgaactgaaggbggccccbggctacbacactgtcacggcagagcaggatgcccggg 
gcatggtggagbtccaggagggcgbggagccggtggacgbgcgagbgcccctcbtcatccggccbgaggatgatgabgaga 

65 agcagcbgcbggtggaggccabbgabgtccctgbgagcacbgccaccctbggbcgccgtctggbaaacatcaccabbabca 
aggaacaagctagtggggtagbgtccbbcgagcagcctgaatacbcggbgagbcgbggagaccaggbggcccgcabcccbg 
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tcabccggcacabcctggacaabggcaagbcccaggbcbccbabagcacacaggabaabacagcacacggacaccgggabt 
atgbbcccgbggagggagagcbgcbgbbccabccbggggaaaccbggaaggagbbgcaggbgaagctactggagccgcacg 
aggtfcgactccctcctgcgtggccgccaggtccgccgcttccaagtccaactcagcaaccccaagttcggagcccgcctgg 
gccagcccagcacaaccaccgttattctcgatgaaacggacaggagtctcataaatcaaacactttcabcgcctccgccac 
5 cccatggagacctgggcgccccacagaaccccaatgccaaggctgccggatccaggaagatccatttcaactggctgcccc 
ctcctggcaagccaatggggtacagggtgaagtaccggatccagggcgactctgaatctgaagcccaccttctagatagca 
aggbgcccbcagbggaacbcaccaaccbgbabcccbabbgcgacbacgaaabgaaggbgbgbgccbabggggccaagggtg 
aggggccctabagcbcacbggbgbccbgccgcacccaccaggaagbacccagtgagccagggacgcbggcttbcaabgbag 
tcbcbbcbacggbgacbcagcbgagcbgggcagagccagcfcgagaccaatggcgagatcacagcctacgaggbctgctatg 

10 gactggbcaabgaggacaacagacccabtggacctatgaagaaggtgctcgtggacaaccccaagaaccggatgctgctca 
ttgagaabcbgcgagatbcccagccataccgabacacggbbaaggcgcgcaatggggcaggabggggacccgagagagagg 
ctatcatcaacctggctacacagcccaagcggcccabgtccatcccbabcabcccagacabccccabagbggacgcccagg 
gtggagaagacbacgaaaactbccbbabgbacagbgabgacgbccbgcggbccccagccagcagccagaggcccagcgbbb 
cbgabgacactgagcacctggbgaabggccggatggacbbbgccbabccaggcagcgccaacbcccbgcacagaabgacbg 

15 cagccaabgbggccbabggcacgcabctgagcccacaccbgbcccaccgagbgcbgagcacgtccbccaccctbacccggg 
acbaccactcbcbgacacgcacagagcacbcccacbcaggcacacbbcccagggactactccaccctcacttccctttcct 
cccaagcctcccbccbatcbgggaagabgggaggagcaggcbbccgcbgbcctggacbcbbgggbccttgagccgggcbca 
catgaagggtgtgcccgcatccaggggbbcaccagactctataabcctggccgggcagtcagcagcaccctccbggggbac 
aggattcccgbggggctgbgggtgtgccbgacacacccactcggcbggbgbfccbcbgcccbggggcgcacgbcbbfcgaagg 

20 tgagctggcaggagccacagbgbgabcggacgcbgctgggcbacagbgtggaabaccagctacbaacgbgcgbcgagabgc 
abcggcbcaacatcccbaacccbggccaaaccbcggbggbggbagaggabctccbgccbaactacbctcabgbgtbccggg 
bacgggcacagagccaggagggcbggggtcgagagcgagagggbgbcabcaccabcgagbcccaggbgcacccgcagagcc 
cbcbcbgcccccbgccaggcbcagccbbcacbctgagcacccccagcgccccaggaccactggbgbtcacbgccctaagcc 
cagactccccgcagctcagctgggagcggccgaggagccgcaatggagatabcctbggctacctggtgacctgtgagatgg 

25 cccaaggaggagcaccagccaggacctbccgggbggacggagacaaccctgagagccggbbgacbgbaccbggccbcagtg 
agaacgbbccbbacaagttcaaggttcaggccaggacgaccgagggcttbgggccagagcgbgagggtatcabcaccabcg 
agtctcaggttggaggccccbbcccacagcbgggcagcaabbcbgggcbcbbccagaacccagbgcaaagcgagbbcagca 
gcgbgaccagcacgcacagcaccacgacbgagcccbbccbcatggabggtcbaaccctggggacccagcgcctggaagcag 
gaggcbcccbcacccggcabgbgacccaggaabbcgbgacccggaccbbaacggccagbggcbcbcbcagcacbcababgg 

30 accaacagbtctbccaaacctgaaccbcccccgcgccccagccaccbgggccccbcctbgcctccbcbccbagcgccttct 
bccbcbgcbgcbcbacccacgagcbtgctgaccacagagccagccccbgbagbcagagagcaggggbaggtgcbgbccagg 
aaccataaagbgggbagaggtgatacaaggtctttctgactgcatcccaccctgggbccaabcccacabgtaacc 

8. SEP ID NO:8 Murine integrin beta4 protein sequence: 

MAGPCCSPWV KLLLLiARMLS ASLPGDLANR CKKAQVKSCT ECIRVDKSCA YCTDELFKERRCNTQADVLA 
35 AGCRGESILV MESSLEITEN TQIVTSLHRS QVSPQGLQVR LRRGEERTFVFQVFEPLESP VDLYILMDFS 
NSMSDDLDNL KQMGQNLAKI LRQIjTSDYTI G FGKFVDKVS VPQTDMRP EK LKEPWPNSDP PFSFKNVISL 
TENVEEFWNK LQGERISGNIi DAPEGGFDAILQTAVCTRDI GWRADSTHLL VFSTESAFHY EADGANVLAG 
IMNRNDEKCH LDASGAYTQY 

KTQDYPSVPT LVRLIoAKHNI I.P,IFAVTNYS YSYYEKLHKY FPVSSLGVLQ EDSSNIVELLEEAFYRIRSN 
40 LDIRALDSPR GLRTEVTSDT LQKTETGSFH I KRGEVGTYN VHLRAVED I DGTHVCQLAKE DQGGNIHLKP 
SFSDGLRMDA SVICDVCPCE LQKEVRSARC HFRGD FMCGHCVCNEGWSGK TCNCSTGSLS DTQPCLREGE 
DKPCSGHGEC QCGRCVCYGE GRYEGHFCEYDNFQCPRTSG FLCNDRGRCS MGECVCEPGW TGRSCDCPLS 
NATCIDSNGG I CNGRG YCEC 

GRCHCNQQSL YTDTTCEINY SAILGLCEDL RSCVQCQAWG TGEKKGRACD DCPFKVKMVDELKKEEWEY 
45 CSFRDEDDDC TYSYNVEGDG SPGPNSTVLV HKKKDCLPAP SWWLIPLLIFLLLLIiAIiLLL LCWKYCACCK 
ACLGLLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLRSGNLKGRDTV RWKITNNVQR PGFATHAAST 
SPTELVPYGIi SLRLGRLCTE NLMKPGTRECDQLRQEVEEN LNEVYRQVSG AHKLQQTKFR QQPNAGKKQD 
HTIVDTVLLA PRSAKQMIiLK 

LTEKQVEQGS FHELKVAPGY YTVTAEQDAR GMVEFQEGVE LVDVRVP1/F1 RPEDDDEKQLLVEAIDVPVS 
50 TATIiGRRbVN ITIIKEQASG WSFEQPEYS VSRGDQVARI PVI RHI LDNGKSQVSYSTQD NTAHGHRDYV 
PVEGELLFHP GETWKELQVK LLELQEVDSL LRGRQVRR FQVQLSN P KFGA RLGQPSTTTV ILDETDRSLI 
NQTLSSPPPP HGDLGAPQNP NAKAAGSRKI H FNWLP P PGK PMGYRVKYWI QGDSESEAHb liDSKVPSVEL 
TNLYPYCDYE MKVCAYGAKG 

EGPYSSLVSC RTHQEVPSEP GRLAFNWSS TVTQLSWAEP AETNGEITAY EVCYGLVNEDNRPIGPMKKV 
55 LVDNPKNRMEi LIENtiRDSQP YRYTVKARNG AGWGPEREAI INLATQPKRPMSIPIIPDIP IVDAQGGEDY 
ENFLMYSDDV LRSPASSQRP SVSDDTEHLV NGRMDFAYPGSANSltHRMTA ANVAYGTHLS PHLSHRVL3T 
SSTLTRDYHS LTRTEHSHSG TLPRDYSTLTSLSSQASLLS GKMGGAGFRC PGbliGP 

9. SEP ID NO:9 Mouse beta4 dominant negative cds: 

afcggcagggcccbgbtgcagcccabgggbgaagcbgcfcgcbgctggcacgaabgcbgagbgccagccbcccbggagaccbg 
60 gccaaccgcbgcaagaaggctcaggbgaagagctgbaccgagbgcatccgggbggacaagagcbgbgccbacbgcacagac 
gagcbgtbcaaggagaggcgcbgcaacacccaggcggacgbbcbggcbgcaggcbgcaggggagagagcabcctggbcatg 
gagagcagccbbgaaatcacagagaacacccagabcgbcaccagccbgcaccgcagccaggbabctccccaaggcctgcaa 
gbccggcbgcggcggggbgaggagcgcacgtbtgbgbtccaggbcttbgagccccbggagagccccgtggabctgbatabc 
cbcabggactfcctccaacbccatgbcbgacgatcbggacaacctcaagcagabggggcagaaccbggccaagabccbgcgc 
65 cagcbcaccagcgactacaccatfcggabbbggaaagttbgfcggacaaagtcagcgbcccacagacagacabgaggcccgag 
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aaactgaaggagcccbggcccaacagtgabcccccgbtctccbbcaagaacgttatcagcttaacggagaatgtggaagaa 
btcbggaacaaacbgcaaggagaacgcabctcaggcaaccbggacgcbccbgaagggggcbbtgatgccabccbgcagaca 
gctgbgbgcacaagggacatbggcfcggagggctgacagcacccacctgcbggtgbfcctccaccgagbcbgccttccactac 
gaggctgabggbgccaacgbbctggccggcabcabgaaccgcaabgabgagaaabgccaccbggacgccbcgggcgcctac 
5 acccaatacaagacacaggactacccafccagtgcccaccctggttcgcctgcttgccaagcataacabcatccccatcttb 
gctgtcaccaactactcttacagctactatgagaagctccataagtatttccccgtctcctctctgggcgtccfcgcaggag 
gattcabccaacabcgtggagctgctggacgaggccbtcbatcgaatbcgctccaacctggacabccgggctcbggacagc 
cccagaggccbgagaacagaggtcacctccgabacbcbccagaagacggagactgggbccttbcacatcaagcggggggaa 
gbgggcacabacaabgbgcabcbccgggcagbggaggacabagabgggacacabgbgbgccagcbggcbaaagaagaccaa 

10 gggggcaacatccacctgaaacccfcccttctctgatggcctccggatggacgcgagtgbgatctgbgacghgtgcccctgt 
gagcbgcaaaaggaagbbcgatcagctcgcbgbcacbbcagaggagacbbcabgbgbggacacbgfcgbgbgcaabgagggc 
tggagtggcaaaacctgcaactgctccaccggctcbcbgagtgacacacagcccbgccbgcgbgagggbgaggacaaaccg 
tgctcgggccacggcgagtgccagbgcggacgctgtgbgtgctafcggbgaaggccgctacgagggtcactbctgcgagtat 
gacaactbccagtgbccccggacctctggabbcctgbgcaatgaccggggacgcbgtbcbabgggagagtgbgtgtgtgag 

1 5 ccbggbbggacaggccgcagcbgcgactgbccccbcagcaabgccaccbgcabcgabagcaacgggggcatcbgcaacggc 
cgaggcbacbgbgagbgtggccgtbgbcacbgcaaccagcagbcgctctacacggacaccaccbgtgagabcaacbactct 
gcgabacbgggbctcfcgtgaggabcbccgcbccbgcgbacagbgccaggccbggggcaccggggagaagaaagggcgcgcg 
tgbgacgattgccccbbbaaagbcaagabggbagacgagcbbaagaaagaagaggbggbggagbacbgcbccbtccgggab 
gaggabgacgactgcacttacagcbacaacgbggagggcgacggcagcccfcgggcccaacagcacagbccbggtccacaaa 

20 aagaaagactgcctcccggctcctbcctggbggctcatccccctgcbcatcttcctcctgbtgcbcctggcgttgcbtctg 
cbgcbctgcbggaaabga 

10. SEP IP NO:10 Mouse beta4 dominant negative protein sequence: 

MAGPCCSPWV KLLLLARMLS ASLPGDLANR CKKAQVKSCT ECIRVDKSCA YCTD ELF KERR CNTQADVLA 
AGCRGESILV MESSLBITEN TQIVTSLHRS QVSPQGLQVR LRJRGEERTFVFQVFEPLES P VDLYILMDFS 
25 NSMSDDLDNL KQMGQNLAKI LRQLTSDYTI GFGKFVDKVSVPQTDMRPEK LKEPWPNSDP PFSFKNVISL 
TENVEEFWNK LQGERISGNL DAPEGGFDAILQTAVCTRDI GWRADSTHLL VFSTESAFHY EADGANVLAG 
IMNRNDEKCH LDASGAYTQY 

KTQDYPSVPT I;VRJjLAKHNI IPIFAVTNYS YSYYEKLHKY FPVSSIjGVLQ EDSSNIVELLEEAFYRIRSN 
I/DIRALDSPR GLRTEVTSDT LQKTETGSFH IKRGEVGTYN VHLRAVEDI DGTHVCQLAKE DQGGNIHLKP 
30 S FSDGLRMD A SVICDVCPCE LQKEVRSARC H FRGD FMCGHCVCNEG WSG K TCNCSTGSLS DTQPCLREGE 
DKPCSGHGEC QCGRCVCYGE GRYEGHFCEYDNFQCPRTSG FLCNDRGRCS MGECVCEPGW TGRSCDCPLS 
NATCIDSNGG ICNGRGYCEC 

GRCHCNQQSIi YTDTTCEINY SAILGLCEDL RSCVQCQAWG TGEKXGRACB DCPFKVKMVDELKKEEWEY 
CSFRDEDDDC TYSYNVEGDG SPGPNSTVLV HKKKDCLPAP S WWLI PLLI FLLLLLALLiLLiIjCWK 

35 11. SEP ID NO:ll Murine laminin gamma2 chain complete cds (accfl 

U43327V. 

atgcctgcgctctggctcagctgctgcctcggtgtcgcgctcctgctgcccgccagccaggccacctccagga 
gggaagtcbgtgattgcaatgggaagtccaggcaabgtgbctttgatcaggagctccatcgacaagcaggcag 
cgggbtccgtbgcctcaacbgcaabgacaabacagcgggggttcacbgcgagcggtcgagggaggggttttac 

40 cagcatcagagcaagagccgcbgccbacccbgcaacbgccacbcaaagggbtccctcagtgcbggabgbgaca 
actctggacaatgcaggtgtaagccaggtgtgacaggacaaagatgtgaccagtgtcagccaggcttccatat 
gctcaccgatgctggatgcacccgagaccaggggcaactagattccaagtgtgactgtgacccagctggcatc 
tctggaccctgtgabbctggccgabgtgbcbgcaaaccagccgbcacbggagagcgctgbgabaggbgccgac 
cacgtgactatcatctggaccgggcaaaccctgagggctgtacccagtgtttctgctabgggcattcagccag 

45 ctgccacgcctctgccgacttcagtgtccacaaaatcacttcaactttcagtcaggatgtggatggttggaag 
gcggbbcagagaaacggggcaccbgcaaaactccacbggtcacagcgccabcgggacgbgbbbagbbcbgccc 
gaagatcagaccccgtctatttcgtggcccctgccaaattcctcggtaaccagcaagtgagttacgggcagag 
cctgtcttttgactaccgcgtggacagaggaggtagacagccgtctgcctacgatgtgatcctggaaggtgct 
ggtctacagatcagagctcctctgatggctccaggcaagacactbccttgtgggatcacaaagacttacacat 

50 tcagactgaatgaacatccaagcagbcactggagtccccagctgagttatttcgaatabcgaaggttactgcg 
gaacctcacagccctccbgabgabccgagctacgtacggagaabatagbacagggtacatbgataacgtgacc 
ctggtttcagcccgccctgtccttggagccccagccccttgggttgaacgttgtgtatgccbgctggggtaca 
agggacaabbcbgccaggaatgbgcbbctggbbacaaaagagatbcggcaagattgggcgcbttbggcgcctg 
tgttccctgtaactgccaaggggagggggcctgtgatccagacacgggagattgctactcgggggacgagaab 

55 cctgacattgagtgbgctgactgtcccatcggbbtcbacaatgacccacabgacccccgcagcbgcaagccab 
gbcccbgbcacaabgggbbcagcbgtbcagtgabgccfcgagacagaggaggbggbgbgtaacaactgbccccc 
tggggtcacaggtgcccgcbgtgagctcbgbgctgatggcbtcbbbggggabcccbbbggggaacatggccca 
gtgaggccbbgtcaacgcbgccaabgcaacaacaacgbggaccccaabgcctctgggaactgbgaccagttga 
caggcagabgcbtgaaabgbabcbacaacacggccggbgbcbacbgtgaccagtgcaaagcaggtbacbttgg 

60 agacccatbggctcccaacccagcagacaagtgtcgagcttgcaactgcagccccabgggbgcggagcctgga 
gagtgtcgaggbgabggcagctgbgbbtgcaagccaggctbbggcgccbtcaacbgbgabcacgcagcccbaa 
ccagbtgbcctgcttgctacaatcaagtgaagattcagabggaccagtbbacccagcagctccagagcctgga 
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ggccctggtttcaaaggctcagggtggtggtggtggtggbacagtcccagtgcagctggaaggcaggatcgag 
caggctgagcaggcccttcaggacattctgggagaagctcagatttcagaaggggcaabgagagccgttgctg 
tccggctggccaaggcaaggagccaagagaacgactacaagacccgcctggatgacctcaagatgactgcaga 
aaggatccgggccctgggcagtcagcatcagaacagagttcaggatacgagcagactcatctctcagahgcgc 
5 ctgagtctggcaggaagcgaagctctcttggaaaacactaatatccattcttctgagcactacgtggggccga 
atgattttaaaagtctggctcaggaggctacaagaaaggcagacagccacgctgagtcagctaacgcaatgaa 
gcaactagcaagggaaactgaggactactccaaacaagcactttcattggcccgcaagctcttgagtggagga 
ggcggaagtggctctbgggacagctccgtggbacaaggtcbtatgggaaaattagagaaaaccaagbccctga 
gccagcagctgtcattggagggcacccaagccgacattgaagctgataggtcgtatcagcacagtctccgcct 

10 cctggattctgcctctcagcttcagggagtcagtgatctgtcctttcaggtggaagcaaagaggatcagacaa 
aaggctgattctctctcaaacctggtgaccagacaaacggabgcabbcacgcgtgtgcgaaacaatctgggga 
actgggaaaaagaaacacggcagcttttacagactggaaaggataggagacagacttcagatcagctgctttc 
ccgtgccaacctbgctaaaaacagagcccaagaagcgcbaagtatgggcaatgccactttttabgaagtbgag 
aacatcctgaagaacctccgagagtttgatchgcaggttgaagacagaaaagcagaggctgaagaggccatga 

15 agagactctcctctattagccagaaggttgcggatgccagtgacaagacccagcaagcagaaacggccctggg 
gagcgccactgccgacacccaacgggcaaagaacgcagctagggaggccctggagatcagcagcgagabagag 
ctggagatagggagtctgaacttggaagctaatgtgacagcagahggggccttggccatggagaaagggactg 
ccactctgaagagcgagatgagagagatgattgagctggccagaaaggagctggagtttgacacggataagga 
cacggtgcagctggtgattactgaagcccagcaagctgatgccagagccacgagtgccggagttaccatccaa 

20 gacacrctcaacacattggacggcatcctacacctcatagaccagcctggcagtgtggatgaagaagggatga 
tgctattagaacaagggcttttccaagccaagacccagatcaacagtcgacttcggcccttgatgtctgacct 
ggaggagagggtgcgtcggcagaggaaccacctccatctgctggagactagcatagatggaattcttgctgat 
gtgaagaacctggagaacattcgagacaacctgcccccaggctgctacaatacccaagctcttgagcaacagt 
ga 

25 12, SEP IT> NO:12 Murine laminin gamma2 chain protein sequence (acc# 

AAA85256> " 

1 mpalwlsccl gvalllpasq atsrrevcdc ngksrqcvfd qelhrqagsg frclncndnt 
61 agvhcersre gfyqhqsksr clpcnchskg slsagcdnsg qcrckpgvtg qrcdqcqpgf 
121 hmltdagctr dqgqldskcd cdpagiagpc dsgrcvckpa vtgercdrcr prdyhldran 

30 181 pegcbqcfcy ghsaschasa dfsvhkibab fsqdvdgwka vqrngapakl hwsqrhrdvf 
241 ssarrsdpvy fvapakflgn qqvsygqsls fdyrvdrggr qpsaydvile gaglqirapl 
301 mapgktlpcg itktytfrln ehpsshwspq Isyfeyrrll rnltallmir atygeysfcgy 
361 idnvtlvsar pvlgapapwv ercvcllgyk gqfcqecasg ykrdsarlga fgacvpcncq 
421 gegacdpdtg dcysgdenpd iecadcpigf yndphdprsc kpcpchngfs csvmpeteev 

35 481 vcnncppgvt garcelcadg ffgdpfgehg pvrpcqrcqc nnnvdpnasg ncdqltgrcl 
541 kciyntagvy cdqckagyf3 dplapnpadk cracncspmg aepgecrgdg scvckpgfga 
601 fncdhaalbs cpacynqvici qmdqftqqlq slealvakaq ggggggtvpv qlegrieqae 
661 qalqdilgea qisegamrav avrlakarsq endyktrldd lkmtaerira Igsqhqnrvq 
721 dtsrlisqmr lslagseall entnihsseh yvgpndfksl aqeatrkada haesanamkq 

40 781 laretedysk qalslarkll sggggsgswd sswqglmgk lektkslsqq Islegtqadi 
841 eadrsyqhsl rlldsasqlq gvsdlsfqve akrirqkads lsnlvtrqtd aftrvrnnlg 
SOI nweketrqll qbgkdrrqbs dqllsranla knraqealsm gnatfyeven ilknlrefdl 
961 qvedrkaeae eamkrlssie qkvadaBdkt qqaetalgsa tadbqrakna arealeisse 
1021 ieleigslnl eanvtadgal amekgbablk semremiela rkelefdtdk dtvqlvitea 

45 1081 qqadarabsa gvtiqdtlnb ldgilhlidq pgsvdeegmm lleqglfqak tqinsrlrpl 
1141 msdleervrr qrnhlhllet sidgiladvk nlenirdnlp pgcyntqale qq 

13. SEP IP NO: 13 Human laminin gamma2 chain complete cds (acc# 
AH006634^ 

atgcctgcgctctggctgggctgctgcctctgcfctctcgctcctcctgcccgcagcccgggccacctccaggagggaagtc 
50 bgbgatbgcaabgggaagbccaggcagbgtabcbtbgabcgggaacttcacagacaaacbggfcaabggabbccgctgcctc 
aacbgcaabgacaacactgabggcabtcacbgcgagaagbgcaagaatggcttbbaccggcacagagaaagggaccgctgb 
ttgccctgcaattgtaactccaaaggttctctfcagtgctcgatgtgacaactctggacggtgcagctgtaaaccaggtgtg 
acaggagccagatgcgaccgatgtctgccaggctfcccacatgctcacggafcgcggggtgcacccaagaccagagactgcta 
gactccaagtgtgactgtgacccagctggcatcgcagggccctgtgacgcgggccgctgtgtctgcaagccagctgttact 
55 ggagaacgctgbgataggbgbcgabcaggbbacbabaabcbggatggggggaacccbgagggcbgbacccagbgtbbctgc 
fcatgggcatbcagccagctgccgcagcbctgcagaabacagtgtccabaagabcaccbcbacctbbcabcaagatgttgab 
ggcbggaaggctgbccaacgaaabgggbcbccbgcaaagcbccaabggbcacagcgccatcaagabgbgbbbagcbcagcc 
caacgactagaccctgtcbabtttgbggcbccfcgccaaabbbcttgggaatcaacaggtgagctabgggcaaagcctgtcc 
fcttgactaccgbgbggacagaggaggcagacacccabcfegcccabgabgbgatbccggaaggtgcbggtcbacggatcaca 
60 gctcccbtgabgccacbfcggcaagacacbgcctbgbgggcbcaccaagacfctacacattcaggttaaatgagcatccaagc 
aabaatbggagcccccagctgagbtacttbgagtabcgaaggbbactgcggaabcbcacagccctccgcatccgagcbaca 
tabggagaabacagtactgggbacabtgacaabgbgacccbgabbbcagcccgcccbgbctcbggagccccagcaccctgg 
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gtbgaacagbgtabatgtcctgbbgggtacaaggggcaatbctgccaggattgtgcfctcbggctacaagagagabbcagcg 
agactggggcctbttggcaccbgbabtccbtgbaacbgtcaagggggaggggccbgbgatccagacacaggagattgttab 
tcaggggatgagaatcctgacabtgagtgtgctgactgcccaattggtttctacaacgatccgcacgacccccgcagctgc 
aagccatgtcccbgtcataacgggttcacctgcbcagbgabgccggagacggaggaggbggbgtgcaataacbgcccbccc 
5 ggggtcaccggbgcccgcbgbgagctcbgtgctgatggctacbbbggggaccccbbtggbgaacatggcccagbgaggccb 
tgtcagccctgtcaatgcaacaacaatgtggaccccagbgcccctgggaattgtgaccggctgacaggcaggtgtttgaag 
tgtatccacaacacagccggcatctactgcgaccagtgcaaagcaggctacttcggggacccatcggctcccaacccagca 
gacaagtgbcgagcttgcaactgtaaccccatgggctcagagcctgtaggatgtcgaagtgatggcacctgtgttbgcaag 
ccaggabtbggfcggccccaacbgtgagcatggagcatbcagctgbccagctbgcbataabcaagtgaagabbcagabggab 

10 cagttbatgcagcagcbtcagagaatggaggccctgabtbcaaaggcbcagggbggbgabggagtagbacctgabacagag 
ctggaaggcaggatgcagcaggcbgagcaggcccttcaggacafctctgagagatgcccagatttcagaaggbgctagcaga 
tcccttggbctccagttggccaaggbgaggagccaagagaacagcbaccagagccgcctggatgaccbcaagatgacbgtg 
gaaagagbbcgggcbcbgggaagbcagbaccagaaccgagbbcgggabacbcacaggctcatcactcagabgcagctgagc 
ctggcagaaagbgaagcttccbbgggaaacactaacattcctgccbcagaccacbacgtggggccaaabggcbtbaaaagt 

1 5 cbggcbcaggaggccacaagabbagcagaaagccacgbtgagtcagccagtaacabggagcaactgacaagggaaacbgag 
gacbabbccaaacaagcccbcbcacbggbgcgcaaggccctgcabgaaggagbcggaagcggaagcggbagcccggacggb 
gcfcgtggbgcaagggcbtgbggaaaaatbggagaaaaccaagtcccbggcccagcagbfcgacaagggaggccacbcaa'gcg 
gaaabbgaagcagabaggfccbbatcagcacagtcbccgccbccbggatbcagbgbcbcggcbbcagggagbcagbgabcag 
tccbtbcaggtggaagaagcaaagaggabcaaacaaaaagcggabbcactcbcaacgcbggtaaccaggcatabggatgag 

20 ttcaagcgbacacaaaagaatctgggaaactggaaagaagaagcacagcagctcttacagaabggaaaaagtgggagagag 
aaabcagabcagctgctttcccgtgccaatcbbgctaaaagcagagcacaagaagcactgagtatgggcaabgccactbtb 
babgaagbbgagagcabccttaaaaaccbcagagagbbbgacctgcaggbggacaacagaaaagcagaagcbgaagaagcc 
abgaagagactcbccbacatcagccagaaggbtbcagabgccagbgacaagacccagcaagcagaaagagcccbggggagc 
gcbgctgcbgabgcacagagggcaaagaabggggccggggaggccctggaaabcbccagtgagattgaacaggagatbggg 

25 agbcbgaacbtggaagccaabgtgacagcagatggagccttggccabggaaaagggacbggccbctcbgaagagbgagatg 
agggaagbggaaggagagcbggaaaggaaggagctggagbbbgacacgaatabggabgcagtacagatggtgatbacagaa 
gcccagaaggtbgataccagagccaagaacgctggggbtacaatccaagacacacbcaacacatcagacggcctcctgcat 
ctgatggaccagccbcbcagbgbagabgaagaggggcbggtcbtacbggagcagaagcfcbbcccgagccaagacccagabc 
aacagccaactgcggcccabgabgbcagagctggaagagagggcacgbcagcagaggggccaccbccatbtgcbggagaca 

30 agcabagabgggabbcfcggcbgabgbgaagaacbtggagaacabbagggacaaccbgcccccaggcbgcbacaabacccag 
gctcbfcgagcaacagtga 

14. SEP ID NO:14 Human lamlnin gamma2 chain protein sequence facc# 
AAC50457^ Alternative splice form 

1 mpalwlgccl cfslllpaar atsrrevcdc ngksrqcifd relhrqbgng frclncndnt 
35 61 dgihcekckn gfyrhrerdr clpcncnskg slsarcdnsg rcsckpgvbg arcdrclpgf 
121 hmlbdagcbq dqrlldskcd cdpagiagpc dagrcvckpa vtgercdrcr sgyynldggn 
181 pegcbqcfcy ghsascrssa eyavhkibat fhqdvdgwka vqmgspakl qwsqrhqdvf 
241 ssaqrldpvy fvapakflgn qgvsygqsls fdyrvdrggr hpsahdvile gaglritapl 

3 01 mplgkblpcg ltktybfrln ehpsnnwspq Isyfeyrrll rnltalrira bygeystgyi 
40 3 SI dnvblisarp vsgapapwve qcicpvgykg qfcqdcasgy krdsarlgpf gtcipcncqg 

421 ggacdpdbgd cysgdenpdi ecadcpigfy ndphdprsck pcpchngf sc svmpeteew 

4 81 cnncppgvbg arcelcadgy fgdpfgehgp vrpcqpcqcn nnvdpsasgn cdrltgrclk 
541 cihnbagiyc dqckagyfgd plapnpadkc racncnpmga epvgcrsdgt cvckpgfggp 
601 ncehgafscp acynqvkiqm dqfmqqlqrm ealiskaqgg dgwpdbele grmqqaeqal 

45 661 qdilrdaqis egasrslglq lakvrsqens yqsrlddlkm bvervralgs qyqnrvrdbh 
721 rlibqmqlsl aeseaslgnfc nipasdhyvg pngfkslaqe atrlaeshve sasnmeqltr 
781 etedyakqal slvrkalheg vgsgsgspdg awqglvekl ektkslaqql treatqaeie 
841 adrsyqhslr lldsvsrlqg vsdqsfqvee akrikqkads lsfclvbrhmd efkrtqknlg 
901 nwkeeaqqll qngksgreks dqllsranla ksraqealsm gnabfyeves ilknlrefdl 

50 961 qvdnrkaeae eamkrlsyis qkvsdasdkt qqaeralgsa aadaqrakng agealeisse 
1021 ieqeigslnl eanvtadgal amekglaslk semrevegel erkelefdbn mdavqmvibe 
1081 aqkvdtrakn agvtiqdbln bldgllhlmd qplsvdeegl vlleqklsra kbqinsqlrp 
1141 mmseleerar qqrghlhlle bsidgiladv knlenirdnl ppgcynbqal eqq 

15. SEP ID NO:15 Human {aminjn gamma2 chain protein sequence (acc# 
55 AAC50456) Alternative splice form 

1 mpalwlgccl cfslllpaar afc6rrevcdc ngksrqcifd relhrqbgng frclncndnb 
61 dgihcekckn gfyrhrerdr clpcncnskg slsarcdnsg rcsckpgvbg arcdrclpgf 
121 hmlbdagcbq dqrlldskcd cdpagiagpc dagrcvckpa vbgercdrcr sgyynldggn 
131 pegctqcfcy ghsascrssa eysvhkitst fhqdvdgwka vqmgspakl qv/sqrhqdvf 

60 241 ssaqrldpvy fvapakflgn qqvsygqsls fdyrvdrggr hpsahdvile gaglribapl 
301 mplgkblpcg lbkbytfrln ehpsnnwspq Isyfeyrrll rnlbalrira bygeystgyi 
361 dnvblisarp vsgapapwve qcicpvgykg qfcqdcasgy krdsarlgpf gtcipcncqg 
421 ggacdpdbgd cysgdenpdi ecadcpigfy ndphdprsck pcpchngf sc svmpebeew 
4 81 cnncppgvbg arcelcadgy fgdpfgehgp vrpcqpcqcn nnvdpsasgn cdrlbgrclk 

65 541 cihntagiyc dqckagyfgd plapnpadkc racncnpmgs epvgcrsdgt cvckpgfggp 
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601 ncehgafscp acynqvJciqm dqfmqqlqrm ealiskaqgg dgvvpdtele grmqqaeqal 
661 qdilrdaqis egasrslglq lakvrsqens yqsrlddlkm tvervralgs qyqnrvrdth 
721 rlifcqmqlsl aeseaslgnt nipasdhyvg pngfkslage atrlaeshve sasnmeqltr 
781 etedyskqal slvrkalheg vgsgsgspdg awqgivekl ektkslaqql treatqaeie 
5 841 adrsyqhslr lldsvsrlqg vsdqsfqvee akrikqkads lstlvtrhmd efkrtqknlg 
901 nwkeeaqqll qngksgreks dqllsranla ksraqealsm gnatfyeves ilknlrefdl 
961 qvdnrkaeae eamkrlsyis qkvsdasdkt qqaeralgsa aadaqrakng agealeisse 
1021 ieqeigslnl eanvtadgal amekglaslk semrevegel erkelefdtn mdavqmvite 
1081 aqkvdtrakn agvtiqdfcln tldgllhlmg m 

10 16. Degenerate Human intcgrin beta4 subunit cds 

atg gca gCg cca cgc ccc age cca fcgg gec agg ctg etc ctg gca gec ttg ate age gtc 
age etc tct ggg ace btg aac cgc tgc aag aag gec cca gtg aag age tgc acg gag tgt 
gtc cgt gtg gat aag gac tgc gec tac tgc aca gac gag atg ttc agg 

gaccggcgctgcaacacccaggcggagctgctggccgcgggctgccagcgggagagcatcgtggtcatggagagcagcttc 

15 caaatcacagaggagacccagattgacaccaccctgcggcgcagccagatgtccccccaaggcctgcgggtccgtctgcgg 
cccggtgaggagcggcattttgagctggaggtgtttgagccactggagagccccgtggacctgtacatcctcatggacttc 
tccaactccatgtccgatgatctggacaacctcaagaagatggggcagaacctggctcgggtcctgagccagctcaccagc 
gactacactattggatttggcaagtttgtggacaaagtcagcgtcccgcagacggacatgaggcctgagaagctgaaggag 
ccttggcccaacagtgacccccccttctccttcaagaacgtcatcagcctgacagaagatgtggatgagttccggaataaa 

20 ctgcagggagagcggatctcaggcaacctggatgctcctgagggcggcttcgatgccatcctgcagacagctgtgtgcacg 
agggacattggctggcgcccggacagcacccacctgctggtcttctccaccgagtcagccttccactatgaggctgatggc 
gccaacgtgctggctggcatcatgagccgcaacgatgaacggtgccacctggacaccacgggcacctacacccagtacagg 
acacaggactacccgtcggtgcccaccctggtgcgcctgctcgccaagcacaacatcatccccatctttgctgtcaccaac 
tactcctatagctactacgagaagcttcacacctatttccctgtctcctcactgggggtgctgcaggaggactcgtccaac 

25 atcgtggagctgctggaggaggccttcaatcggatccgctccaacctggacatccgggccctagacagcccccgaggcctt 
cggacagaggtcacctccaagatgttccagaagacgaggactgggtcctttcacatccggcggggggaagtgggtatatac 
caggtgcagctgcgggcccttgagcacgtggatgggacgcacgtgtgccagctgccggaggaccagaagggcaacatccat 
ctgaaaccttccttctccgacggcctcaagatggacgcgggcatcatctgtgatgtgtgcacctgcgagctgcaaaaagag 
gtgcggtcagctcgctgcagcttcaacggagacttcgtgtgcggacagtgtgtgtgcagcgagggctggagtggccagacc 

30 tgcaactgctccaccggctctctgagtgacattcagccctgcctgcgggagggcgaggacaagccgtgctccggccgtggg 
gagtgccagtgcgggcactgtgtgtgctacggcgaaggccgctacgagggtcagttctgcgagtatgacaacttccagtgt 
ccccgcacttccgggttcctctgcaatgaccgaggacgctgctccatgggccagtgtgtgtgtgagcctggttggacaggc 
ccaagctgtgacfcgtcccctcagcaatgccacctgcatcgacagcaatgggggcatctgtaatggacgtggccactgtgag 
tgtggccgctgccactgccaccagcagtcgctctacacggacaccatctgcgagatcaactactcggcgtccacccgggcc 

35 tctgcgaggacctacgctcctgcgtgcagtgccaggcgtggggcaccggcgagaagaaggggcgcacgtgtgaggaatgca 
acttcaaggtcaagatggtggacgagcttaagagaggcgaggaggtggtggtgcgctgctccttccgggacgaggatgacg 
actgcacctacagctacaccatggaaggtgacggcgcccctgggcccaacagcaccgtcctggtgcacaagaagaagggac 
tgccctccgggctccttcfcggtggctcatccccctgctcctcctcctcctgccgctcctggccctgctactgctgctatgc 
tggaagtactgtgcctgctgcaaggcctgcctggcacttctcccgtgctgcaaccgaggtcacatggtgggctttaaggaa 

40 gaccactacatgctgcgggagaacctgatggccfcctgaccacttggacacgcccatgctgcgcagcgggaacctcaagggc 
cgtgacgtggtccgctggaaggtcaccaacaacatgcagcggcctggctttgccactcatgccgccagcatcaaccccaca 
gage tggtgccctacgggctgtccttgcgcctggcccgcctttgcaccgagaacc tgc tgaagcctgacactcgggagtgc 
gcccagctgcgccaggaggtggaggagaacctgaacgaggtctacaggcagatctccggtgtacacaagctccagcagacc 
aagttccggcagcagcccaafcgccgggaaaaagcaagaccacaccattgtggacacagtgctgatggcgccccgctcggcc 

45 aagccggccctgctgaagcttacagagaagcaggtggaacagagggccttccacgacctcaaggtggcccccggctactac 
accctcactgcagaccaggacgcccggggcatggtggagttccaggagggcgtggagctggtggacgtacgggtgcccctc 
tttatccggcctgaggatgacgacgagaagcagctgctggtggaggccatcgacgtgcccgcaggcactgccaccctcggc 
cgccgcctggtaaacatcaccafccatcaaggagcaagccagagacgtggtgtcctttgagcagcctgagttctcggtcagc 
cgcggggaccaggtggcccgcafcccctgtcatccggcgtgtcctggacggcgggaagtcccaggtctcctaccgcacacag 

50 gatggcaccgcgcagggcaaccgggactacatccccgtggagggtgagctgctgttccagcctggggaggcctggaaagag 
ctgcaggtgaagctcctggagctgcaagaagttgactccctcctgcggggccgccaggtccgccgtttccacgtccagctc 
agcaaccctaagtttggggcccacctgggccagccccactccaccaccatcatcatcagggacccagatgaactggaccgg 
agcttcacgagfccagatgtfcgtcatcacagccaccccctcacggcgacctgggcgccccgcagaaccccaatgctaaggcc 
gctgggtccaggaagatccatttcaactggctgcccccttctggcaagccaatggggtacagggtaaagtactggattcag 

55 ggtgactccgaatccgaagcccacctgctcgacagcaaggtgccctcagtggagctcaccaacctgtacccgtattgcgac 
tatgagatgaaggtgtgcgcctacggggctcagggcgagggaccctacagctccctggtgtcctgccgcacccaccaggaa 
gtgcccagcgagccagggcgtctggccttcaatgtcgtctcctccacggtgacccagctgagctgggctgagccggctgag 
accaacggtgagatcacagcctacgaggtctgctatggcctggtcaacgatgacaaccgacctattgggcccatgaagaaa 
gtgctggttgacaaccctaagaaccggatgctgcttattgagaaccttcgggagtcccagccctaccgctacacggtgaag 

60 gcgcgcaacggggccggctgggggccfcgagcgggaggccatcatcaacctggccacccagcccaagaggcccatgtccatc 
cccatcatccctgacatccctatcgtggacgcccagagcggggaggactacgacagcttccttatgtacagcgatgacgtt 
ctacgctctccatcgggcagccagaggcccagcgtctccgatgacactggctgcggctggaagttcgagcccctgctgggg 
gaggagctggacctgcggcgcgtcacgtggcggctgcccccggagctcatcccgcgcctgtcggccagcagcgggcgctcc 
tccgacgccgaggcgccccacgggcccccggacgacggcggcgcgggcgggaagggcggcagcctgccccgcagtgcgaca 

65 cccgggccccccggagagcacctggtgaatggccggatggactttgccttcccgggcagcaccaactccctgcacaggatg 
accacgaccagtgctgctgcctatggcacccacctgagcccacacgtgccccaccgcgtgctaagcacatcctccaccctc 
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acacgggactacaacbcacbgacccgcbcagaacacbcacacbcgaccacacbgccgagggacbactccaccctcacctcc 
gbcbccbcccacggccbcccbcccatcbgggaacacgggaggagcaggcttccgcbgtcctgggcccbggggbcccggagb 
cgggctcagatgaaagggttccccccttccaggggcccacgagactctataatcctggctgggaggccagcagcgccctcc 
tggggcccagactctcgcctgactgcbggtgtgcccgacacgcccacccgcctggtgbtctcbgcccbggggcccacabcb 
5 cbcagagbgagcbggcaggagccgcggbgcgagcggccgcbgcagggctacagbgbggagtaccagctgcbgaacggcggb 
gagctgcabcggcbcaacabccccaacccbgcccagacctcggbggbggbggaagaccbccbgcccaaccacbccbacgbg 
bfcccgcgbgcgggcccagagccaggaaggcbggggccgagagcgbgagggbgtcabcaccabbgaatcccaggtgcacccg 
cagagcccactgtgtccccbgccaggctccgcctbcacttbgagcacbcccagtgccccaggcccgcbggtgbtcactgcc 
cbgagcccagacbcgcbgcagcbgagctgggagcggccacggaggcccaatggggatatcgbcggcbaccbggtgaccbgt 

10 gagacggcccaaggaggaggbccagccaccgcattccgggbggabggagacagccccgagagccggcbgaccgbgccgggc 
ctcagcgagaacgtgcccbacaagbbcaaggbgcaggccaggaccacbgagggcbbcgggccagagcgcgagggcabcabc 
accatagagbcccaggabggaggbccctbcccgcagcbgggcagccgbgccgggcbcbbccagcacccgcbgcaaagcgag 
tacagcagcatchccaccacccacaccagcgccaccgagcccttcctagtgggtccgaccctgggggcccagcacctggag 
gcaggcggcbcccbcacccggcabgbgacccaggagbbbgbgagccggacacbgaccaccagcggaacccbtagcacccac 

15 abggaccaacagbbcbbccaaacbbga 

17. SEP ID NO:17 Human beta4 jntegrin protein sequence Variant V at 34 to 
t 

1 magprpspwa rlllaalisv slsgtlnrck kaplkscbec vrvdkdcayc bdemfrdrrc 
61 ntqaellaag cqresiwme ssfqibeebq idbblrrsqm spqglrvrlr pgeerhfele 
20 121 vfeplespvd lyilmdfsns msddldnlkk mgqnlarvls qltsdytigf gkfvdkvsvp 
181 qbdmrpeklk epwpnsdppf sCknvislbe dvdefrnklq gerisgnlda peggfdailq 
241 bavctrdigw rpdsbhllvf sbesafhyea dganvlagim srnderchld tbgbybqyrb 
301 qdypsvpblv rllakhniip ifavtnysys yyeklhtyfp vsslgvlqed ssnivellee 

3 61 afnrirsnld iraldsprgl rtavbskmfq kbrtgsfhir rgevgiygvq lralehvdgb 
25 421 hvcqlpedqk gnihlkpsfs dglkmdagii cdvcfccelqk evraarcsfn gdfvcgqcvc 

4 81 segwsgqbcn cebgslsdiq pclregedkp csgrgecqcg hcvcygegxy egqfceydnf 
541 qcprbsgflc ndrgrcsmgq cvcepgwbgp scdcplsnab cidsnggicn grghcecgrc 
601 hchqqslybd biceinysas brasarbyap acsarrgapa rrrgarvrna bsrsrwwbsl 
661 rearrwwcaa psgbrmbbap batpwkvtap Igptalswcb rrrdcppgsf wwliplllll 

30 721 lpllalllll cwkycaccka clallpccnr ghmvgfkedh ymlrenlmas dhldbpmlrs 
7 81 gnlkgrdwr wkvtnnmqrp gfabhaasin pbelvpygls lrlarlcben llkpdbreca 
841 qlrqeveenl nevyrqisgv hklqqbkfrq qpnagkkqdh bivdtvlmap raakpallkl 
901 bekqveqraf hdlkvapgyy bltadqdarg mvefqegvel vdvrvplfir pedddekqll 
961 veaidvpagb ablgrrlvni biikeqardv vsfeqpef sv srgdqvarip virrvldggk. 

35 1021 sqvsyrbqdg baqgnrdyip vegellfqpg eawkelqvkl lelqevdsll rgrqvrrfhv 
1081 qlsnpkfgah lgqphsbtii irdpdeldrs ftsqmlasqp pphgdlgapq npnakaagsr 
1141 kihfnwlpps gkpmgyrvky wiqgdsesea hlldskvpsv elbnlypycd yemkvcayga 
12 01 qgegpysslv scrbhqevps epgrlafnw ssbvbqlswa epaebngeit ayevcyglvn 
1261 ddnrpigpmk kvlvdnpknr mllienlres qpyrytvkar ngagwgpere aiinlabqpk 

40 1321 rpmsipiipd ipivdaqsge dydsflmysd dvlrspsgeq rpsvsddbgc gwkfepllge 
1381 eldlrrvbwr lppeliprls assgrssdae aphgppddgg aggkggslpr sabpgppgeh 
1441 lvngrmdfaf pgstnslhrm bttsaaaygb hlsphvphrv lsbsstltrd ynslbrsehs 
1501 hsbtlprdys tlbsvsshgl ppiwehgrsr Iplswalgsr sraqmlcgfpp srgprdsiil 
1561 agrpaapswg pdsrltagvp dbptrlvfsa Igpbslrvsw qeprcerplq gysveyqlln 

45 1621 ggelhrlnip npaqtswve dllpnhsyvf rvraqsqegw greregvibi esqvhpqspl 
1681 cplpgsaftl stpsapgplv fbalspdslq lswerprrpn gdivgylvbc emaqgggpab 
1741 afrvdgdspe srlbvpglse nvpykfkvqa rtbegfgper egiibiesqd ggpfpqlgsr 
1801 aglfqhplqs eyssiabbht sabepflvgp blgaqhleag gslbrhvtqe fvsrtlttsg 
1861 blsbhmdqqf fqt 

50 18. SEP ID NO:18 Human integrin beta4 subunit cds Variant at 34 V to I: 

abg gca ggg cca cgc ccc age cca bgg gec agg cbg cbc cbg gca gec btg abc age gtc 
age cbc beb ggg acc bbg aac cgc bgc aag aag gec cca AbA aag age bgc acg gag bgb 
gtc cgt gtg gab aag gac tgc gec bac tgc aca gac gag abg bbc agg 

gaccggcgctgcaacacccaggcggagctgcbggccgcgggcbgccagcgggagagcabcgbggtcabggagagcagcbtc 
55 caaabcacagaggagacccagabbgacaccacccbgcggcgcagccagabgbccccccaaggccfcgcgggbccgbcbgcgg 
cccggbgaggagcggcabbbbgagccggaggbgbbbgagccacbggagagccccgbggacctgbacatccbcabggacbbc 
tccaacbccabgbccgatgatcbggacaaccbcaagaagatggggcagaaccbggctcgggbcctgagccagctcaccagc 
gacbacacbabbggabbbggcaagtttgbggacaaagtcagcgtcccgcagacggacacgaggccbgagaagcbgaaggag 
cctbggcccaacagbgacccccccttctccttcaagaacgtcatcagccbgacagaagabgbggabgagbbccggaabaaa 
60 ctgcagggagagcggabctcaggcaaccbggatgcbccbgagggcggcfcbcgabgccabccbgcagacagcbgbgbgcacg 
agggacattggcbggcgcccggacagcacccaccbgcbggbcbbcbccaccgagbcagccbbccacbabgaggcbgabggc 
gccaacgbgcbggcbggcabcabgagccgcaacgatgaacggbgccaccbggacaccacgggcaccbacacccagbacagg 
acacaggacbacccgbcggtgcccacccbggbgcgccbgctcgccaagcacaacabcatccccatcbbbgcbgbcaccaac 
tacbccbabagcbacbacgagaagcbtcacaccbabtbcccbgbcbccbcacbgggggbgcbgcaggaggacbcgtccaac 
65 atcgbggagcbgcbggaggaggccbbcaabcggabccgcbccaaccbggacatccgggcccbagacagcccccgaggcctt 
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cggacagaggtcacctccaagabgttccagaagacgaggacbgggtcctbbcacabccggcggggggaagbgggtatatac 
caggtgcagctgcgggcccttgagcacgtggatgggacgcacgbgtgccagctgccggaggaccagaagggcaacatccat 
cbgaaaccbtccbbctccgacggcctcaagatggacgcgggcatcabctgbgabgtgbgcacctgcgagctgcaaaaagag 
gtgcggtcagctcgctgcagcttcaacggagacttcgtgtgcggacagtgtgtgtgcagcgagggctggagtggccagacc 
5 tgcaactgctccaccggctctctgagbgacattcagccctgccbgcgggagggcgaggacaagccgtgctccggccgtggg 
gagtgccagtgcgggcactgtgtgtgctacggcgaaggccgcbacgagggbcagtbctgcgagtatgacaacttccagtgt 
ccccgcactbccgggbtccbctgcaabgaccgaggacgcbgcbccabgggccagbgbgbgtgbgagccbggbbggacaggc 
ccaagctgbgactgbcccctcagcaabgccaccbgcafccgacagcaatgggggcabctgbaabggacgbggccacbgbgag 
bgbggccgcbgccacbgccaccagcagbcgcbcbacacggacaccabcbgcgagabcaactacbcggcgbccacccgggcc 

10 tctgcgaggaccbacgctcctgcgtgcagcgccaggcgbggggcaccggcgagaagaaggggcgcacgtgtgaggaatgca 
acttcaaggtcaagatggtggacgagctbaagagaggcgaggaggbggtggtgcgctgcbcctbccgggacgaggatgacg 
actgcaccbacagctacaccabggaaggtgacggcgcccctgggcccaacagcactgtcctggtgcacaagaagaagggac 
bgccctccgggcbccbbcbggtggcbcatccccctgctccbccbccbccbgccgctccbggcccbgcbacbgctgcbatgc 
bggaagtacbgbgcctgcbgcaaggccbgccbggcacbbcbcccgbgcbgcaaccgaggtcacabggbgggcbbtaaggaa 

15 gaccacbacabgctgcgggagaaccbgabggccbcbgaccacbbggacacgcccabgcbgcgcagcgggaaccbcaagggc 
cgtgacgbggtccgctggaaggtcaccaacaacatgcagcggccbggcbttgccacbcatgccgccagcabcaaccccaca 
gagcbggbgcccbacgggcbgbccbbgcgccbggcccgccbbtgcaccgagaacctgctgaagccbgacactcgggagtgc 
gcccagctgcgccaggaggtggaggagaacctgaacgaggtcbacaggcagatcbccggtgtacacaagctccagcagacc 
aagttccggcagcagcccaatgccgggaaaaagcaagaccacaccattgbggacacagtgctgabggcgccccgctcggcc 

20 aagccggccctgctgaagcbtacagagaagcaggtggaacagagggccbbccacgaccbcaaggbggcccccggctactac 
acccfccactgcagaccaggacgcccggggcabggtggagbbccaggagggcgbggagcbggbggacgbacgggbgccccbc 
bbbabccggccbgaggabgacgacgagaagcagctgcbggtggaggccabcgacgtgcccgcaggcacbgccaccctcggc 
cgccgccbggbaaacabcaccabcabcaaggagcaagccagagacgbggbgbccbbtgagcagccbgagbbcbcggbcagc 
cgcggggaccaggtggcccgcabccctgtcatccggcgbgtcctggacggcgggaagtcccaggtctcctaccgcacacag 

25 gatggcaccgcgcagggcaaccgggacbacabccccgtggagggbgagcbgctgttccagcctggggaggcctggaaagag 
cbgcaggtgaagctcctggagctgcaagaagttgactccctcctgcggggccgccaggtccgccgtttccacgtccagctc 
agcaaccctaagtbbggggcccaccbgggccagccccacbccaccaccabcatcabcagggacccagabgaacbggaccgg 
agcbtcacgagbcagabgbbgbcabcacagccacccccbcacggcgaccbgggcgccccgcagaaccccaabgcbaaggcc 
gcbgggtccaggaagabccabbtcaacbggctgccccctbcbggcaagccaatggggbacagggtaaagbacbggattcag 

30 ggbgactccgaabccgaagcccaccbgctcgacagcaaggbgcccbcagbggagcfccaccaaccbgfcacccgtabtgcgac 
babgagabgaaggbgbgcgccbacggggcbcagggcgagggacccbacagcbcccbggbgbccbgccgcacccaccaggaa 
gtgcccagcgagccagggcgtctggccbtcaabgtcgtctcctccacggtgacccagctgagctgggctgagccggctgag 
accaacggtgagafccacagcctacgaggbcbgctatggccbggtcaacgabgacaaccgacctatbgggcccatgaagaaa 
gbgctggbtgacaacccbaagaaccggatgcfcgctbabtgagaaccbbcgggagbcccagccctaccgcbacacggbgaag 

35 gcgcgcaacggggccggcbgggggcctgagcgggaggccabcatcaaccbggccacccagcccaagaggcccatgtccabc 
cccatcabcccbgacabccctatcgbggacgcccagagcggggaggacbacgacagcbtccbbabgbacagcgabgacgtb 
cbacgcbctccabcgggcagccagaggcccagcgbcbccgabgacactggcbgcggcbggaagbbcgagccccbgcbgggg 
gaggagcbggacctgcggcgcgtcacgbggcggcbgcccccggagctcabcccgcgccbgbcggccagcagcgggcgctcc 
bccgacgccgaggcgccccacgggcccccggacgacggcggcgcgggcgggaagggcggcagccbgccccgcagtgcgaca 

40 cccgggccccccggagagcaccbggtgaatggccggatggactttgccttcccgggcagcaccaactccctgcacaggabg 
accacgaccagbgcbgctgcctatggcacccaccbgagcccacacgtgccccaccgcgtgctaagcacabccbccaccctc 
acacgggactacaacbcacbgacccgcbcagaacacbcacacbcgaccacactgccgagggacbacbccacccbcaccbcc 
gbcbcctcccacggccbccctcccabcbgggaacacgggaggagcaggctbccgctgbccbgggcccbggggbcccggagt 
cgggcbcagabgaaagggbbccccccbbccaggggcccacgagacbctabaabcctggcbgggaggccagcagcgccctcc 

45 bggggcccagacbctcgccbgactgcbggbgbgcccgacacgcccacccgccbggtgtbcbcbgcccbggggcccacabcb 
cbcagagtgagcbggcaggagccgcggbgcgagcggccgcbgcagggcbacagbgtggagbaccagcbgctgaacggcggb 
gagctgcabcggctcaacabccccaaccctgcccagaccbcggtggfcggbggaagacctcctgcccaaccacbcctacgtg 
tbccgcgbgcgggcccagagccaggaaggctggggccgagagcgbgagggbgtcatcaccattgaabcccaggbgcacccg 
cagagcccacbgbgbccccbgccaggcbccgccbbcacbbfcgagcacbcccagbgccccaggcccgcbggbgbbcacbgcc 

50 cbgagcccagacbcgcbgcagctgagctgggagcggccacggaggcccaabggggabatcgbcggcbaccbggbgaccbgb 
gagabggcccaaggaggaggtccagccaccgcatbccgggbggatggagacagccccgagagccggcbgaccgbgccgggc 
cbcagcgagaacgbgcccbacaagbtcaaggtgcaggccaggaccacbgagggcttcgggccagagcgcgagggcabcabc 
accatagagtcccaggatggaggtcccttcccgcagcbgggcagccgtgccgggctcbtccagcacccgctgcaaagcgag 
tacagcagcatctccaccacccacaccagcgccaccgagcccttccbagtgggtccgaccctgggggcccagcacctggag 

55 gcaggcggcbcccbcacccggcabgtgacccaggagbbtgbgagccggacacbgaccaccagcggaacccttagcacccac 
abggaccaacagfcbcbfcccaaacbbga 

19. SEP ID NO: 19 Degenerate Human integrin beta4 subunit cds variant at 
34 Vto L 

atg gca ggg cca cgG ccc age cca tgg gec agg ctg etc ctg gca gec ttg ate age gbc 
60 age etc bet ggg ace btg aac cgc tgc aag aag gee cca ktA aag age tgc acg gag tgt 
gtc cgfc gtg gab aag gac tgc gee bac bgc aca gac gag abg bbc agg 

gaccggcgcbgcaacacccaggcggagctgcbggccgcgggcbgccagcgggagagcabcgbggbcatggagagcagcbbc 
caaabcacagaggagacccagatbgacaccaccctgcggcgcagccagabgtccccccaaggcctgcgggtccgfcctgcgg 
cccggtgaggagcggcabtbtgagcbggaggbgttbgagccactggagagccccgtggacctgbacabcctcabggacbtc 
65 bccaacfcccabgbccgabgabcbggacaaccbcaagaagabggggcagaaccbggctcgggbccbgagccagctcaccagc 
gactacacbattggatbtggcaagtttgtggacaaagtcagcgtcccgcagacggacatgaggccbgagaagctgaaggag 
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cctfcggcccaacagtgacccccccttctccttcaagaacgtcatcagccfcgacagaagatgtggatgagttccggaataaa 
cbgcagggagagcggabcbcaggcaaccbggatgcbccbgagggcggcttcgabgccabccbgcagacagctgtgtgcacg 
agggacattggctggcgcccggacagcacccacctgctggfccttctccaccgagtcagccttccactatgaggctgatggc 
gccaacgtgctggctggcatcatgagccgcaacgatgaacggtgccacctggacaccacgggcacctacacccagtacagg 
5 acacaggactacccgtcggtgcccaccctggtgcgcctgctcgccaagcacaacatcatccccatctttgctgbcaccaac 
tactcctatagctactacgagaagcttcacacctatttccctgtcfccctcactgggggtgctgcaggaggactcgtccaac 
abcgbggagctgcfcggaggaggccttcaatcggatccgctccaacctggacatccgggccctagacagcccccgaggccbt 
cggacagaggbcaccbccaagabgbbccagaagacgaggacbgggbccbtbcacabccggcggggggaagbgggbababac 
caggtgcagcbgcgggcccbtgagcacgbggabgggacgcacgbgbgccagcbgccggaggaccagaagggcaacabccab 

10 cbgaaaccttccttctccgacggccbcaagatggacgcgggcabcabctgbgatgbgbgcaccbgcgagctgcaaaaagag 
gtgcggtcagcbcgcbgcagctbcaacggagactbcgbgtgcggacagbgbgtgbgcagcgagggctggagbggccagacc 
bgcaacbgcbccaccggcbctcbgagbgacabbcagcccbgccbgcgggagggcgaggacaagccgbgcbccggccgbggg 
gagbgccagbgcgggcacbgbgbgtgcbacggcgaaggccgcbacgagggbcagbbcbgcgagbabgacaacbtccagtgb 
ccccgcacbbccgggbfcccbcbgcaabgaccgaggacgcbgcbccabgggccagbgbgbgbgbgagccbggbtggacaggc 

15 ccaagcbgfcgacbgbccccbcagcaabgccacctgcabcgacagcaatgggggcabcbgtaabggacgbggccacbgbgag 
tgbggccgcbgccacbgccaccagcagtcgctcbacacggacaccabctgcgagabcaactacbcggcgbccacccgggcc 
bcbgcgaggacctacgctccfcgcgbgcagtgccaggcgbggggcaccggcgagaagaaggggcgcacgtgbgaggaabgca 
actbcaaggtcaagatggbggacgagcctaagagaggcgaggaggtggtggtgcgctgctccttccgggacgaggatgacg 
acbgcaccbacagctacaccatggaaggbgacggcgccccfcgggcccaacagcacbgbcctggbgcacaagaagaagggac 

20 bgcccbccgggcbccbtcbggtggcbcatcccccbgcbcctccbccbccbgccgcbccbggcccbgcbacbgcbgctatgc 
bggaagtacbgtgcctgctgcaaggcctgcctggcactbctcccgtgctgcaaccgaggtcacatggtgggcbttaaggaa 
gaccacbacabgcbgcgggagaaccbgabggcctcbgaccacbbggacacgcccabgcbgcgcagcgggaacctcaagggc 
cgbgacgbggbccgctggaaggbcaccaacaacabgcagcggccbggctbbgccacbcafcgccgccagcafccaaccccaca 
gagcbggbgccctacgggcbgtccbbgcgcctggcccgcctbbgcaccgagaaccbgcbgaagcctgacacfccgggagbgc 

25 gcccagctgcgccaggaggtggaggagaacctgaacgaggtctacaggcagatcbccggtgtacacaagctccagcagacc 
aagtbccggcagcagcccaatgccgggaaaaagcaagaccacaccabtgtggacacagbgcbgatggcgccccgcbcggcc 
aagccggcccbgctgaagcbbacagagaagcaggbggaacagagggccttccacgaccbcaaggbggcccccggcbacbac 
accctcacbgcagaccaggacgcccggggcabggbggagbtccaggagggcgbggagcbggtggacgbacgggbgccccbc 
ttbatccggcctgaggatgacgacgagaagcagcbgctggbggaggccatcgacgtgcccgcaggcactgccaccctcggc 

30 cgccgcctggbaaacabcaccabcabcaaggagcaagccagagacgbggtgbcctbtgagcagccbgagbbcbcggtcagc 
cgcggggaccaggbggcccgcabcccbgbcabccggcgbgtccbggacggcgggaagbcccaggbcbccbaccgcacacag 
gatggcaccgcgcagggcaaccgggactacatccccgtggagggtgagcbgctgttccagccbggggaggcctggaaagag 
cbgcaggbgaagctccbggagctgcaagaagbbgacbcccbccbgcggggccgccaggbccgccgbbbccacgtccagcbc 
agcaacccbaagbbtggggcccaccbgggccagccccacbccaccaccabcabcabcagggacccagatgaacbggaccgg 

35 agcbbcacgagbcagafcgbbgtcafccacagccacccccbcacggcgacctgggcgccccgcagaaccccaabgcbaaggcc 
gcbgggbccaggaagabccattbcaactggcbgcccccbbcbggcaagccaabggggbacagggbaaagbactggabbcag 
ggbgactccgaatccgaagcccacctgctcgacagcaaggtgccctcagtggagctcaccaacctgtacccgtattgcgac 
babgagabgaaggbgbgcgcctacggggcbcagggcgagggacccbacagctccctggbgbccbgccgcacccaccaggaa 
gtgcccagcgagccagggcgtctggcctbcaabgbcgtcbcctccacggtgacccagcbgagcbgggcbgagccggcbgag 

40 accaacggtgagatcacagcctacgaggbctgcbabggcctggbcaacgatgacaaccgaccbabtgggcccatgaagaaa 
gbgctggtbgacaaccctaagaaccggabgctgcbbabbgagaaccttcgggagtcccagccctaccgctacacggbgaag 
gcgcgcaacggggccggcbgggggccbgagcgggaggccabcabcaacctggccacccagcccaagaggcccabgbccabc 
cccabcabcccbgacabcccbabcgbggacgcccagagcggggaggacbacgacagcbbcctbabgtacagcgabgacgbb 
cbacgcfccbccatcgggcagccagaggcccagcgbctccgabgacacbggcbgcggcbggaagbtcgagccccbgcbgggg 

45 gaggagcbggacctgcggcgcgbcacgbggcggcbgcccccggagcbcatcccgcgccbgtcggccagcagcgggcgcbcc 
bccgacgccgaggcgccccacgggcccccggacgacggcggcgcgggcgggaagggcggcagccbgccccgcagbgcgaca 
cccgggccccccggagagcacctggbgaatggccggabggactbbgccttcccgggcagcaccaactcccbgcacaggatg 
accacgaccagbgcbgctgcctabggcacccaccbgagcccacacgbgccccaccgcgbgctaagcacabccbccaccctc 
acacgggactacaacbcactgacccgctcagaacactcacactcgaccacacbgccgagggactacbccaccctcacctcc 

50 gbcbccbcccacggccbcccbcccabcbgggaacacgggaggagcaggctbccgcbgbcctgggcccbggggbcccggagt 
cgggcbcagabgaaagggtbccccccttccaggggcccacgagactcbabaabccbggcbgggaggccagcagcgccctcc 
bggggcccagactcbcgccbgacbgcbggtgbgcccgacacgcccacccgccbggbgtbctctgcccbggggcccacabcb 
cbcagagtgagctggcaggagccgcggbgcgagcggccgcbgcagggcbacagtgbggagtaccagcbgcbgaacggcggb 
gagcbgcafccggctcaacabccccaacccbgcccagaccbcggbggbggbggaagaccbccfcgcccaaccacbcctacgbg 

55 bbccgcgbgcgggcccagagccaggaaggcbggggccgagagcgbgagggbgbcabcaccatbgaatcccaggbgcacccg 
cagagcccacbgfcgfcccccbgccaggcfcccgccbbcacbbtgagcacfccccagbgccccaggcccgctggtgbfccacbgcc 
ctgagcccagactcgcbgcagcbgagcbgggagcggccacggaggcccaabggggababcgtcggcbacctggtgaccbgb 
gagabggcccaaggaggaggbccagccaccgcabbccgggbggafcggagacagccccgagagccggcbgaccgbgccgggc 
cbcagcgagaacgbgcccbacaagbbcaaggtgcaggccaggaccactgagggcbbcgggccagagcgcgagggcabcatc 

60 accatagagtcccaggatggaggtcccttcccgcagctgggcagccgtgccgggctcbtccagcacccgctgcaaagcgag 
tacagcagcabctccaccacccacaccagcgccaccgagcccttcctagtgggtccgaccctgggggcccagcacctggag 
gcaggcggcbcccbcacccggcabgbgacccaggagbbbgbgagccggacacbgaccaccagcggaacccbbagcacccac 
abggaccaacagtbcbbccaaacfcbga 
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What is claimed is: 

1. A method of reducing the proliferation of a cancer cell which comprises inhibiting 
ligand binding to an integrin receptor on the cancer cell, wherein the integrin receptor comprises an 
integrin. 

2. The method of claim 1, wherein the integrin receptor comprises integrin B4. 

3. The method of claim I. wherein the integrin receptor comprises integrin A6. 

4. The method of claim I, wherein the ligand that binds to the integrin receptor is 
laminin5. 

5. The method of claim I, wherein the integrin receptor is A6B4. 

6. The method of claim L wherein the ligand comprises lamininS. 

7. The method of claim 1, wherein the ligand comprises the gamma-2 subunit of laminin5. 

8. The method of claim I, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a A6 integrin with a composition that inhibits ligand binding. 

9. The method of claim I, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a B4 integrin with a composition that inhibits ligand binding. 

10. The method of claim I, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a laminin5 with a composition that inhibits ligand binding. 

11. The method of claim 1, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a gamma-2 subunit with a composition that inhibits ligand binding. 

12. A method of reducing the proliferation of a cancer cell which comprises reducing 
integrin-integrin interaction, integrin receptor clustering interaction, or integrin-non-integrin protein 
interaction. 

13. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
A6 integrin with a composition that inhibits an interaction between the B4 integrin and another 
integrin or protein molecule. 

14. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
B4 integrin with a composition that inhibits the interaction between the B4 integrin and another 
integrin or protein molecule.. 

15. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 

receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
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Iarninin5 wim a composition that inhibits ligand binding. 

16. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin- non-integrin protein interaction comprises contacting a 
gamma2 subunit with a composition that inhibits ligand binding. 

17. The method of claim 12, wherein the non-integrin protein comprises a growth factor 
receptor. 

18. The method of claim 12, wherein the non-integrin protein comprises a hemi- 
desomosome junction. 

19. The method of claim 12, wherein the non-integrin protein comprises a SH2 domain. 

20. The method of claim 19, wherein the non-integrin protein comprises a She protein. 

21. The method of claim 12, wherein the non-integrin protein comprises a IRS-1 protein. 

22. The method of claim 12, wherein the non-integrin protein comprises a IRS-2 protein. 

23. A method of reducing the proliferation of a cancer cell which comprises reducing the 
production of an integrin by the cancer cell. 

24. The method of claim 23, wherein the production of an integrin is reduced by inhibiting 
signaling leading to induction of expression of an integrin. 

25. The method of claim 23, wherein reducing the production of an integrin comprises 
inhibiting alpha6 production. 

26. The method of claim 25, wherein inhibiting alpha6 production further comprises using 
antisense molecules to alphajS mRNA. 

27. The method of claim 23, wherein reducing the production of an integrin comprises 
inhibiting beta4 production. 

28. The method of claim 27, wherein inhibiting beta4 production further comprises using 
antisense molecules to beta4 mRNA. 

29. A method of reducing the proliferation of a cancer cell which comprises reducing the 
production of an integrin receptor ligand by the cancer cell. 

30. The method of claim 29, wherein reducing the production of an integrin receptor 
ligand comprises inhibiting gamma2 production. 

31. The method of claim 29, wherein reducing the production of an integrin receptor 
ligand comprises inhibiting laminin production. 

32. The method of claim 29, wherein reducing the production of an integrin receptor 
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ligand comprises inhibiting Iaminin5 production. 

33. A method of reducing the proliferation of a cancer cell which comprises interfering 
with an integrin signaling pathway. 

34. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting an A6 integrin in the cell with a composition that inhibits ligand binding. 

35. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting a B4 integrin with a composition that inhibits ligand binding. 

36. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting a larninin5 with a composition that inhibits ligand binding. 

37. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting a gamma2 subunit with a composition that inhibits ligand binding. 

38. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting the cancer cell with a molecule that interferes with at least one of talin, 
paxillin, vinculin, a CAS family protein, CRX, NCK, FAK, ILK, Src, Fyn, She, Grb-2, Guaning 
nucleotide exchange factors, SOS, DOCK I80,.Vav, Syk, P-l-3 kinase, AKT, Bad, Bid, Caspase 9, 
Cdc42, PAK, Rac, Rho, Rho kinase, Ras, Caveolin, Tetraspan, Receptor-type protein tyrosine 
phosphatase, SHP-2, Alpha-actinin, Filamin. Cytohesin, Beta3-endonexin. ICAP-I, RACK-1, CIB, 
actin, receptor tyrosine kinase, ERS-1 or IRS-2. 

39. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting the cancer cell with an agent that interferes with post-translational 
modification of integrins. 

40. The method of claim 39, wherein the post translational modification is 
glycosylation or phosphorylation. 

41. The method of claims i, 12, 23, 29, 33, wherein the integrin comprises an A6 integrin. 

42. The method of claims 1, 12, 23, 29, 33, wherein the integrin comprises a B4 integrin. 

43. The method of claims 1, 12, 23, 29, 33, further comprising reducing a lamininS- 
integrin interaction. 

44. The method of claims 1, 12, 23, 29, 33, further comprising reducing a laminin5 
gamma2 integrin interaction. 

45. The method of claims 1, 12, 23, 29, 33, wherein reducing the proliferation of the 
cancer cells is selective. 

46. The method of claims i, 12, 23, 29, 33, wherein the cancer cell is not an MDA-MB- 
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435 cell. 

47. The method of claims 1, 12, 23, 29, 33. wherein the cancer cell is not an HMT-3522 

ceil. 

48. The method of claims 1, 12, 23, 29. 33, wherein the cancer cell is not an RKO colon 
carcinoma line. 

49. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell does not express 
exogenous B4 integrin. 

50. The method of claims U 12, 23, 29, 33, further comprising contacting the cancer cells 
with a small molecule, peptide, peptide mimetic, or oligonucleotide or synthetic analog thereof. 

5 1 . The method of claim 50, wherein the cancer ceils are contacted with dominant- 
negative beta 4 integrin. 

52. The method of claim 50, wherein the cancer cell is contacted with an antisense 
molecule. 

53. The method of claim 52, wherein the antisense molecule is linked to a leader 
sequence which enables translocation across a cell membrane. 

54. The method of claim 53, wherein the leader sequence binds to a cell surface protein 
which facilitates internalization. 

55. The method of claim 54, wherein the leader sequence is TAT or antennapedia, or 
fragment thereof. 

56. The method of claim 52, wherein the antisense molecule is an alpha6 RNA 
antisense molecule. 

57. The method of claim 47, wherein the small molecule peptide, peptide mimetic, or 
oligonucleotide or synthetic analog thereof is linked to a carrier. 

58. The method of claim 57, wherein the carrier is at least one of a Iipidic carrier, 
charged carrier, retroviral carrier, TAT or fragment thereof, antennapedia or fragment thereof, or 
polyethylene glycol. 

59. The method of any one of claims 1, 12, 23, 29, 33, further comprising contacting 
the cancer cell with another agent which modulates cell signaling, a chemotherapeutic drug, or 
treated with radiation or angiogenesis inhibitor. 

60. The method of any of claims 1,12, 23, 29, 33, wherein reducing the proliferation 
of cancer cell is in vitro. 

61. The method of any of claims 1, 12, 23, 29, 33, wherein reducing the proliferation 
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of the cancer cell is in vivo, 

62. The method of any of claims 1.12, 23, 29. 33, wherein the cancer cell is selected 
from the group consisting of melanoma, adenoma, lymphoma, myeloma, carcinoma, 
plasmocytoma, sarcoma, glioma, thyoma, leukemia, skin cancer, retinal cancer, breast cancer, 
prostate cancer, colon cancer, esophageal cancer, stomac cancer, pancreas cancer, brain tumors, 
lung cancer, ovarian cancer, cervical cancer, hepatic cancer, gastrointestinal cancer, and head and 
neck cancer cells. 

63. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell is killed. 

64. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 

Ras. 

65. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras and a mutated p53. 

66. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras and activates the AKT/PKB protein. 

67. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras, a mutated p53, and activates the AKT/PKB protein. 

68. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 

APC. 

69. The method of claims 1,12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras and mutated APC. 

70. A method of isolating molecules that bind with a target molecule selected from the 
group consisting, B4 integrin, alpha6 integrin, and the gamma2 subunit of laminin5 comprising 1) 
contacting a library of molecules with the target molecule and 2) collecting molecules that bind the 
target molecule producing an enriched population of molecules. 

71 . The method of claim 70, further comprising the step of repeating steps 1 and 2 with 
the enriched population of molecules. 

72. The method of claim 70, wherein the library comprises a small molecule, peptide, 
peptide mimetic, or oligonucleotide. 

73. A method of assessing a subject's risk of developing cancer comprising determining 
the amount of A6 present in a target cell obtained from the subject, wherein a determination of 
increased levels of A6 correlates with an increased risk of cancer. 

74. The method of claim 73, further comprising comparing the amount A6 present to the 
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amount in axontrol cell. 

75. The method of claim 73, wherein determining the amount of A6 present in the target 
cell comprises assaying the amount of A6 rnRNA in the cell. 

76. The method of claim 75, wherein the assaying the amount of mRNA in the cell 
comprises hybridizing a A6 probe to a sample of the subject's mRNA. 

77. The method claim 75, wherein the assaying the amount of mRNA in the cell comprises 
hybridizing a A6 primer to a sample of the subject's mRNA. 

78. The method claim 77, wherein the assaying the amount of mRNA further comprises 
performing an nucleic acid amplification reaction involving the primer. 

79. The method of claim 78, wherein the nucleic acid amplification reaction comprises 
reverse transcription, producing a cDNA. 

80. The method of claim 79, wherein the nucleic acid amplification reaction further 
comprises performing a polymerase chain reaction on the cDNA. 

81. The method of claim 73, wherein determining the amount of A6 present in the target 
cell comprises assaying the amount of A6 protein in the cell. 

82. A method of assessing a subject's risk of acquiring cancer comprising determining the 
amount of B4 present in a target cell obtained from the subject, wherein a determination of 
increased levels of B4 correlates with an increased risk of cancer. 

83. The method of claim 82, further comprising comparing the amount B4 present to the 
amount in a control cell. 

84. A method of assessing a subject's risk of acquiring cancer comprising determining the 
amount of laminin5 present in a target cell obtained from the subject wherein a determination of 
increased levels of lamininS correlates with an increased risk of cancer. 

85. The method of claim 84, further comprising comparing the amount lamininS present to 
the amount in a control cell. 

86. A method of assessing a subject's risk of acquiring cancer comprising determining the 
amount of gamma2 subunit present in a target cell obtained from the subject, wherein a 
determination of increased levels of gamma2 subunit correlates with an increased risk of cancer. 

87. The method of claim 86, further comprising comparing the amount gamma2 subunit 
present to the amount in a control cell. 

88. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which inhibits ligand binding to an integrin receptor on 
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the cancer cell, wherein the integrin receptor comprises an integrin. 

89. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which reduces integrin- integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction, 

90. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which reduces the production of an integrin or laminin 
by the cancer cell. 

91. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which interfers with an integrin signaling pathway. 

92. The methods of claims 88-91, wherein the reduction in cancer cell proliferation is 
selective. 

93. The method of any of claims 88-91, wherein the administering is local or systemic. 

94. The method of any of claims 88-91, wherein the patient is additionally 
administered an agent which modulates cell signaling, a chemotherapeutic drug, or treated with 
radiation or angiogenesis inhibitor. 

95. The method of claim 94, wherein the additional agent is administered serially or in 
combination. 

96. The method of claim 93, wherein the local administering is direct application to 
cancer cells. 

97. The method of claim 96, wherein the direct application to cancer cells is performed 
during surgery. 

98. The method of claim 97, wherein the direct application to cancer cells is performed 
topically to cancerous tissue. 

99. The method of claim 94, wherein the systemic administration is by subcutaneous, 
intraperitoneal, intra-arterial, intravenous, or bolus administration, or by application through a 
catheter or similar apparatus. 

100. The method of claim 94, wherein the systemic administration comprises a long- 
term release formulation. 

101. The method of claim 94, wherein the systemic administration is by oral 
administration. 

102. The method of claim 101 wherein the oral administration comprises administering 
a pill, capsule, tablet, liquid or suspension. 
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103. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which inhibits ligand binding to integrin receptors on cancer cells wherein the coding sequence is 
under the control of a promoter which functions in mammalian cells. 

104. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which prevents integrin receptor subunits from interacting with one another, prevents integrin 
receptor clustering interaction, or prevents integrin receptor subunits from interacting with other 
proteins on cancer cells wherein the coding sequence is under the control of a promoter which 
functions in mammalian cells. 

105. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which prevents integrin receptor subunits from interacting with one another, prevents integrin 
receptor clustering interaction, or prevents integrin receptor subunits from interacting with other 
proteins in cancer cells wherein the coding sequence is under the control of a promoter which 
functions in mammalian cells. 

106. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which interferes with integrin subunit or laminin production wherein the coding sequence is under 
the control of a promoter which functions in mammalian cells. 

107. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which interferes with an integrin signaling pathway of cells wherein the coding sequence is under 
the control of a promoter which functions in mammalian cells. 

108. The methods of claims 103-107, wherein the reduction in cancer cell proliferation is 
selective. 

109. The method of claims 103-107 wherein the vector is administered directly to a 
cancer cell. 

1 10. The method of claims 103-107, wherein the vector is administered directly to a 
normal cell. 

1 1 1. The method of claims 103-107, wherein the vector is packaged in a viral vector or 
liposome. 

1 12. The method of claim 88 wherein the vector is a retroviral vector. 
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113. 

systemically. 

114. 
application. 

115. 
application. 

116. 



The method of any one of claims 103-107, wherein the vector is administered 
The method of claim 109, wherein the direct administration is by topical 
The method of claim 111, wherein the direct administration is by topical 
The method of claim 1 14, wherein the direct administration is performed during 



surgery. 

1 17. The method of claim 1 15, wherein the direct administration is performed during 

surgery. 

118. The method of any of claims 103-107, wherein the patient is an animal. 

119. The method of claim 118 wherein the animal is a mammal. 

120. The method of claim 1 19, wherein the mammal is human. 

121. The method of claims 103-107, wherein the cancer cells are selected from the 
group consisting of melanoma, adenoma, lymphoma, myeloma, carcinoma, plasmocytoma, 
sarcoma, glioma, thyoma, leukemia, skin cancer, retinal cancer, breast cancer, prostate cancer, 
colon cancer, esophageal cancer, stomac cancer, pancreas cancer, brain tumors, lung cancer, 
ovarian cancer, cervical cancer, hepatic cancer, gastrointestinal cancer, and head and neck cancer 
cells. 

122. The method of any of claims 103-107 wherein the patient is additionally 
administered at least one of another agent which modulates cell signaling, a chemotherapeutic drug, 
an angiogenesis inhibitor or treated with radiation. 

123. The method of claim 120 wherein the other agent which modifies cell signaling, 
chemotherapeutic drug, angiogenesis inhibitor or radiation treatment is administered serially or in 
combination. 
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SEQUENCE LISTING 

<110> University of Rochester Medical Center 

Deleu, Laurent 
Land, Hartmut 

<120> COMPOSITIONS THAT INHIBIT PROLIFERATION 
OF CANCER CELLS 

<130> 21108. 0005P1 

<150> 60/239,705 
<151> 2000-10-12 

<150> 60/242,812 
<151> 2000-10-24 

<160> 19 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3222 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note =» 
synthetic construct 

<400> 1 

atggccgccg ccgggcagct gtgcttgctc tacctgtcgg cggggctcct gtcccggctc 60 

ggcgcagcct tcaacttgga cactcgggag gacaacgtga tccggaaata tggagacccc 120 

gggagcctct tcggcttctc gctggccatg cactggcaac tgcagcccga ggacaagcgg 180 

ctgttgctcg tgggggcccc gcgcggagaa gcgcttccac tgcagagagc caacagaacg 240 

ggagggctgt acagctgcga catcaccgcc cgggggccat gcacgcggat cgagtttgat 300 

aacgatgctg accccacgtc agaaagcaag gaagatcagt ggatgggggt caccgtccag 360 

agccaaggtc cagggggcaa ggtcgtgaca tgtgctcacc gatatgaaaa aaggcagcat 4 20 

gttaatacga agcaggaatc ccgagacatc tttgggcggt gttatgtcct gagtcagaat 480 

ctcaggattg aagacgatat ggatggggga gattggagct tttgtgatgg gcgattgaga 540 

ggccatgaga aatttggctc ttgccagcaa ggtgtagcag ctacttttac taaagacttt 600 

cattacattg tatttggagc cccgggtact tataactgga aagggattgt tcgtgtagag 660 

caaaagaata acactttttt tgacatgaac atctttgaag atgggcctta tgaagttggt 720 

ggagagactg agcatgatga aagtctcgtt cctgttcctg ctaacagtta cttaggtttt 780 

tctttggact cagggaaagg tattgtttct aaagatgaga tcacttttgt atctggtgct 840 

cccagagcca atcacagtgg agccgtggtt ttgctgaaga gagacatgaa gtctgcacat 900 

ctcctccctg agcacatatt cgatggagaa ggtctggcct cttcatttgg ctatgatgtg 960 

gcggtggtgg acctcaacaa ggatgggtgg caagatatag ttattggagc cccacagtat 1020 

fcttgafcagag atggagaagt tggaggtgca gtgtatgtct acatgaacca gcaaggcaga 1080 

tggaataatg tgaagccaat tcgtcttaat ggaaccaaag attctatgtt tggcattgca 1140 

gtaaaaaata ttggagatat taatcaagat ggctacccag atattgcagt tggagctccg 1200 

fcatgatgact tgggaaaggt ttttatctat catggatctg caaatggaat aaataccaaa 1260 

ccaacacagg ttctcaaggg tatatcacct tattttggat attcaattgc tggaaacatg 1320 

gaccttgatc gaaattccta ccctgatgtt gctgttggtt ccctctcaga ttcagtaact 1380 

attttcagat cccggcctgt gattaatatt cagaaaacca tcacagtaac tcctaacaga 1440 

attgacctcc gccagaaaac agcgtgtggg gcgcctagtg ggatatgcct ccaggttaaa 1500 

tcctgttttg aatatactgc taaccccgct ggttataatc cttcaatatc aattgtgggc 1560 

acacttgaag ctgaaaaaga aagaagaaaa tctgggctat cctcaagagt tcagtttcga 1620 

aaccaaggtt ctgagcccaa atatactcaa gaactaactc tgaagaggca gaaacagaaa 1680 

gtgtgcatgg aggaaaccct gtggctacag gataatatca gagataaact gcgtcccatt 1740 
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cccataactg 
ccagaagttc 
ttcttaaaag 
aaattttgca 
gtaccagaac 
agccctrtcca 
abtgcaacgt 
gagaaacagt 
aatccthtta 
acctt tgaca 
aatttggctc 
ggagttgcta 
aaatctgaag 
aaacctctta 
aatgggaaat 
tgtgagccac 
aaacgggaaa 
agaaaatacc 
ctgcgggggc 
tttctagagg 
gtgactgctg 
gtgtttccct 
gctattctcg 
ttcttcaaga 
gctcagccat 



cctcagtgga 
ttccaattct 
agggatgtgg 
cccgagaagg 
tagttctaaa 
acccaaggaa 
ttccagacac 
tgagttgtgt 
aaagaaattc 
ccccatatct 
caattacagc 
aaccttccca 
atgaagtggg 
caaacctcgg 
ggttgcttta 
aaaaggagafc 
ttactgaaaa 
agactcttaa 
tggacagcaa 
aatattccaa 
ctgccgaaaa 
caaagactgt 
ctgggatctt 
gaaataagaa 
ctgataaaga 



gatccaagag 
gaattcagat 
agacgacaat 
aaatcaagac 
agatcagaag 
tcccacaaaa 
tttaacctafc 
tgccaaccag 
aaatgtcact 
ggatattaat 
taaagcaaaa 
ggtgtatttt 
aagtttaata 
cacagcaacc 
tttggtgaaa 
aaactccctg 
acagatagat 
ctgtagcgtg 
ggcgtctctt 
actgaactac 
tatcaggctg 
agctcagtat 
gatgcttgct 
agatcattat 
gaggcttact 



ccaagctctc 
gaacccaaga 
gtatgtaaca 
aaattttctt 
gatattgctt 
gatggcgatg 
tctgcafcata 
aatggctcgc 
ttttatttgg 
ctgaagttag 
gtggttattg 
ggaggtacag 
gagtatgaat 
ttgaacattc 
gtagaatcca 
aacctaacgg 
gataacagaa 
aacgtgaact 
attttgcgct 
ttggacattc 
ccaaatgcag 
tcgggagtac 
ttattagtgt 
gatgccacat 
tctgatgcat 



gtaggcgagt 
cagctcatat 
gcaaccttaa 
atttaccaat 
tagaaataac 
acgcccatga 
gagaactgag 
aagctgactg 
ttttaagtac 
aaacaacaag 
aactgctttt 
ttgttggcga 
tcagggtaat 
agtggccaaa 
aaggattgga 
agtctcacaa 
aattttcttt 
gtgtgaacat 
cgaggttatg 
tcatgcgagc 
gcactcaggt 
cttggtggat 
ttatactatg 
atcacaaggc 
ag 



<210> 2 
<211> 1072 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note 
synthetic construct 



gaattcactt 
tgatgttcac 
actagaatat 
tcaaaaaggt 
agtgacaaac 
ggctaaactg 
ggctttccct 
tgagctcgga 
aactgaagtc 
caatcaagat 
atcggtctcg 
gcaagctatg 
aaacttaggt 
agaaattagc 
aaaggtaact 
ctcaagaaag 
atttgctgaa 
cagatgcccg 
gaacagcaca 
cttcattgat 
tcgagtgact 
catcctagtg 
gaagtgtggt 
tgagatccat 



1800 
18 60 
1920 
1980 
2040 
2100 
2160 
2220 
2230 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3222 



<400> 2 






















Met Ala 


Ala 


Ala 


Gly 


Gin 


Leu 


Cys 


Leu 


Leu 


Tyr 


Leu Ser Ala Gly Leu 


1 






5 










10 


15 


Leu Ser 


Arg 


Leu 
20 


Gly 


Ala 


Ala 


Phe 


Asn 
25 


Leu 


Asp 


Thr Arg Glu Asp Asn 
30 


Val lie 


Arg 
35 


Lys 


Tyr 


Gly 


Asp 


Pro 
40 


Gly 


Ser 


Leu 


Phe Gly Phe Ser Leu 
4 5 


Ala Met 


His 


Trp 


Gin 


Leu 


Gin 


Pro 


Glu 


Asp 


Lys 


Arg Leu Leu Val Gly 


50 










55 










60 


Ala Pro 


Arg 


Gly 


Glu 


Ala 


Leu 


Pro 


Leu 


Gin 


Arg 


Ala Asn Arg Thr Gly 


65 








70 










75 


80 


Gly Leu 


Tyr 


Ser 


Cys 
85 


Asp 


lie 


Thr 


Ala 


Arg 
90 


Gly 


Pro Cys Thr Arg He 
95 


Glu Phe 


Asp 


Asn 
100 


Asp 


Ala 


Asp 


Pro 


Thr 
105 


Ser 


Glu 


Ser Lys Glu Asp Gin 
110 


Trp Met 


Gly 


Val 


Thr 


Val 


Gin 


Ser 


Gin 


Gly 


Pro 


Glv Gly Lys Val Val 




115 










120 






125 


Thr Cys 


Ala 


His 


Arg 


Tyr 


Glu 


Lys 


Arg 


Gin 


His 


Val Asn Thr Lys Gin 


130 










135 










140 


Glu Ser 


Arg 


Asp 


He 


Phe 


Gly 


Arg 


Cys 


Tyr 


Val 


Leu Ser Gin Asn Leu 


145 








150 










155 


160 


Arg lie 


Glu 


Asp 


Asp 


Met 


Asp 


Gly 


Gly 


Asp 


Trp 


Ser Phe Cys Asp Gly 








165 










170 


175 


Arg Leu 


Arg 


Gly 


His 


Glu 


Lys 


Phe 


Gly 


Ser 


Cys 


Gin Gin Gly Val Ala 






180 










185 




190 


Ala Thr 


Phe 
195 


Thr 


Lys 


Asp 


Phe 


His 
200 


Tyr 


He 


Val 


phe Gly Ala Pro Gly 
205 
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Thr Tyr Asn 
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Lys 
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v ai 
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Asp 
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44 J 










230 








1 n Jl 
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Z\ « r\ 


k?XU 
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Leu 
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24 5 














cut? 








C p. - 
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Lys 








260 












lie 


Thr- 


rne 


Va 1 


oer 


bxy 


Aia 


D "A A, 

fro 


Arg 






0 1 c 

1 I 5 










? an 

t. 0 U 


Val 


Leu* 


Leu 


Lys 


Arg 


Asp 


Ma -f- 


T t * A 

Lys 


C A M 

Ser 




290 
















He 


Phe 


Asp 


biy 


blU 


Gly 


T A 1 * 

Leu 


Ala 


Ser 


305 










Tin 








Val 


Val 


Asp 


Leu 


7\ a a 

Asn 


T < • a 

Lys 


Asp 


Gly 


Trp 




















Pro 


Gin 


Tyr 


fne 


ASp 


Arg 


Asp 


Gly 


Glu 








?d n 










04 0 


Tyr Met 


TV a a 

Asn 


bin 


bin 


bly 


Arg 


Trp 


Asn 






o c c 

355 










3 bU 




Asn 


Gly 


Tnr 


x»ys 


Asp 


O A -A 

ser 


Met 


O A A 

fne 


Gly 




370 










J 1 0 






Asp 


He 


Asn 


Gin 


Asp 


Gly 


Tyr 


Pro 


Asp 


385 










3 yu 






Asp Asp 


Leu 


Gly 


T • » a 

Lys 


Val 


Pne. 


He 


Tyr 










405 










Asn 


Thr 


x>ys 


Pro 


Tnr 


Gin 


Val 


Leu 


Lys 








4 










3 D 


Tyr 


Ser 


lie 


Aia 


bly 


Asn 


Met 


7\ A A 

ASp 


Leu 






4 35 










4 4 U 




Val 


Ala 


val 


biy 


oer 


Leu 


Ser 


ASp 


O A — 

Ser 




450 










455 




Pro 


val 


lie 


Asn 


ne 


bin 


T • > A 

Lys 


mi,- 

xnr 


X 1 A 

He 


465 










4 f u 






Asp 


Leu 


Arg 


bin 


jjys 


Tnr , 


A.la 


Cys 


Gly 










485 










Gin 


Val 


Lys 


Ser 


Cys 


Pne 


Glu 


Tyr 


Thr 








a nn 
OUU 










505 


Pro 


Ser 


He 


Ser 


T1 a 

lie 


Val 


Gly 


Thr 


Leu 






515 








r o r\ 

520 




Lys 


Ser 


Gly 


T A«« 

Leu 


O A -A 

ser 


Ser 


Arg 


val 


Gin 




530 
















Pro 


Lys 


Tyr 


lnr 


bin 


blU 


T A ■ ■« 

Leu 


Tnr 


Leu 


545 










OOU 








Cys 


Met 


Glu 


GlU 


xnr 


Leu 


Trp 


T A . « 

Leu 


Gin 


















Arg 


Pro 


lie 


Ova 

Pro 


He 


Tnr 


Ala 


Ser 


Val 








OO U 










585 


Arg 


Arg 


Arg 


val 


Asn 


Ser 


T A.« 

Leu 


Pro 


Glu 






c nc 

595 










OUU 




Asp 


Glu 


Pro 


Lys 


l nr 


Aia 


LI ■» A 

nlS 


T 1 A 

He 


Asp 




510 










olo 




Cys 


Gly 


Asp 


Asp 


Asn 


vai 


Cys 


Asn 


Ser 


625 


















Phe 


Cys 


Thr 


Arg 


Glu 


Gly 


Asn 


Gin 


Asp 










645 








Gin 


Lys 


Gly 


Val 


Pro 


Glu 


Leu 


Val 


Leu 








660 










665 


Leu 


Glu 


He 


Thr 


Val 


Thr 


Asn 


Ser 


Pro 






675 










680 




Lys 


Asp 


Gly 


Asp 


Asp 


Ala 


His 


Glu 


Ala 



690 695 
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Va 1 

v ax 


blU 


Gin 
220 


Lys 


Asn 


Asn 


Thr 


vjxy 


D y a 

235 


Tyr 


bXU 


vai 


Gly 


Gly 
240 


rro 


vai 


Pro 


Aia 


Asn 


Ser 


Tyr 


250 










255 


bxy 


T1 - 

lie 


\1 A 1 

val 


C A -A 

Ser 


T « « A 

Lys 
o *7 n 


Asp 


Glu 


Ala 


TV A A 

Asn 


His 


Ser 

o o c 

285 


Gly 


Ala 


Val 


Aia 


HIS 


Leu 
300 


Leu 


Pro 


Glu 


His 


Ser 


Phe 
315 


Gly 


Tyr 


Asp 


Val 


Ala 
320 


Gin 


Asp 


He 


Val 


He 


Gly 


Ala 


330 










335 




Val 


Gly 


Gly 


Ala 


Val 


Tyr 


Val 










a r- a 

350 




Asn 


Val 


Lys 


Pro 
365 


He 


Arg 


Leu 


He 


Ala 


Val 


Lys 


Asn 


He 


Gly 






380 








He 


Ala 

"5 rt c 

395 


Val 


Gly 


Ala 


Pro 


Tyr 
400 


His 


Gly 


Ser 


Ala 


Asn 


Gly 


He 


410 










415 




Gly 


He 


Ser 


Pro 


Tyr 
430 


Phe 


Gly 


Asp 


Arg 


Asn 


Ser 
445 


Tyr 


Pro 


Asp 


Val 


Thr 


lie 


Phe 


Arg 


Ser 


Arg 






460 








Thr 


Val 


Thr 


Pro 


Asn 


Arg 


He 




475 








480 


Ala 


Pro 


Ser 


Gly 


lie 


Cys 


Leu 


490 










495 




Ala 


Asn 


Pro 


Ala 


Gly 
510 


Tyr 


Asn 


Glu 


Ala 


Glu 


Lys 
525 


Glu 


Arg 


Arg 


Phe 


Arg 


Asn 


Gin 


Gly 


Ser 


Glu 






540 








Lys 


Arg 

555 


Gin 


Lys 


Gin 


Lys 


Val 
560 


Asp 


Asn 


He 


Arg 


Asp 


Lys 


Leu 


570 










575 




Glu 


He 


Gin 


Glu 


Pro 
590 


Ser 


Ser 


val 


T A.-. 

Leu 


Pro 


He 
605 


Leu 


Asn 


Ser 


vai 


nlS 


Pne 


T A, « 

Leu 


Lys 


Glu 


Gly 






620 








7\ A A 

Asn 


T A t 

Leu 
635 


Lys 


Leu 


Glu 


Tyr 


Lys 
640 


r « • a 

Lys 


Phe 


Ser 


Tyr 


Leu 


Pro 


He 


650 










655 




Lys 


Asp 


Gin 


Lys 


Asp 
670 


He 


Ala 


Ser 


Asn 


Pro 


Arg 
685 


Asn 


Pro 


Thr 


Lys 


Leu 


He 

.700 


Ala 


Thr 


Phe 


Pro 
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Asp Thr 


Leu 


Thr 




Ser 


Ala 


Tur- 


Aro 
Mxy 


Glu 


X4CU 


arg 


Ala 


Pho 




Gl n 
\JX u 












710 










715 








7 °n 




gi n 


Leu 


Ser 




Val 


Ala 


A cn 


Gin 


Asn 


Gl u 
wxy 


C a v- 


Gin 

VJX 1 1 


Al a 
MX a 


Sen 
Mop 


\— y o 










725 










730 










7 7 5 


Glu 


Leu 


Gly 


Asn 


Pro 


Phe 


TiV/ C 

juy o 


ni y 


Sen 

M O 1 1 


Ser 


/^O ( 1 


Va 1 
v a X 


Thr 

X (IX 


Pho 
rne 


iyr 


Leu 








740 










745 










750 




Va 1 

v a X 


XjS U 


Car 


Thr 


Thr 

A « »x 


Gin 


Va 1 
v a x 


Th r 
1 nx 


Php 

CI 1 c 


Sen 

nop 


Thr 

X fix 


Prr» 
CIO 


iyr 


Leu 


Asp 


T 1 o 

xxe 






755 










7 










76R 






Asn 


Leu 


Ti \7 c 

-u y o 


T i on 


Gl n 

w -L Li 


Thr 


Thr 
li lx 


q p r 


Sen 
MOll 


Gin 


Sen 
Mo U 


Asn 


Leu 


Ala 
nld 


cro 


xxe 




77H 










775 








780 










Thr 


Ala 


uy o 


Ala 
MXd 


Xjy o 


Va 1 
v ax 


Va 1 


T I o 

xxe 


bJLU 


Leu 


Leu 


Leu 




Va 1 

va X 


Cor 

oer 


bxy 


7 Q q 










790 










795 










800 


v dX 


Al a 
Mia 


Lys 


D yr\ 
CX O 


Q ^ v 
oe x 

fins 


vaXU 


Va 1 

vax 


Tyr 


DK a 

c ne 


L>xy 

0 xu 


Gl \r 

oxy 


TK >- 
xnr 


vax 


v ax. 


uiy 

010 


bXU 


Gin 


Ala 


Ma h 


Lys 






Asp 


uxU 


Va 1 

vax 


v?xy 


Ser 


Leu 


ixe 


bxU 


Tyr 


blU 








820 










825 










830 




Phe 


Arg 


vol 

O J .J 


T 1 p 
XXC 


Asn 


Leu 


mi \/ 
biy 


Lys 
ft d n 


Pro 


Leu 


x nr 


Asn 


Leu 

O'lj 


bxy 


TK v 

inr 


Aia 


Thr 


Leu 
850 


A c n 

mo n 




oxn 


irp 


Pro 
855 


Lys 


gi ii 

bJLU 


Tl o 

Xxe 


Ser 


Asn 
860 


biy 


Lys 


Trp 


Leu 


Leu 


Tyr 


iJcu 


Val 


Lys 


Va 1 
v ax 


uXU 


S er 


Lys 


mi \/ 


Leu 


uxU 


Lys 


vax 


TK v 

xnr 


Lys 


365 










R7 n 










D / J 








880 


Glu 


Pro 


Gl n 


T 

Lys 


1 t-i 

OO J 


Tl a 
X X c 


Moll 


oex 


Leu 


Asn 
890 


Leu 


x nr 


bXU 


oer 


rixS 


Asn 


Ser Arg 


xjy o 


T.\/ c 

x»y j 


TV y-r«r 

nl y 


Gin 


Xxe 


Thr 

xnx 


bXU 


jjy o 


Mi n 
uxn 


T 1 e» 

x xe 


Asp 


Asp 


Asn 


Arg 








Qnn 




















qi n 






Lys 


Phe 


OCX 




tile 


a 1 a 


gi n 


Arg 

V 


Lys 


Tyr 


OXU 


x nr 


Leu 


Asn 


Cys 


Ser 


Val 


Asn 


val 


noil 




Va 1 
v ax 


A.sn 


Tie 

xxe 


Arg 


y s 


Pro 


Leu 


Arg 


ri t/ 
bxy 


Leu 


Asp 




930 




















q a n 








Ser 


Lys 


Ala 
MJLa 




T 

Leu 


xxe 


Leu 


Arg 


Ser 


Arg 


Leu 


Trp 


Asn 


Ser 


TK v 

xnr 


OK /-s 

rne 


945 
























960 


Leu 


Glu 


Glu 


Tyr 


oer 


Lys 


Leu 


Asn 


Tyr 


Leu 


Asp 


He 


Leu 


Met 


Arg 


Ala 




















Q7 n 








975 




Phe 


He 


Asp 


Val 
v ax 


Th r 
X fix 


Ala 


Ala 


Ala 


Glu 


Asn 


He 


Arg 


Leu 


Pro 


Asn 


a 1 a 

Ai.a 








980 

> u w 








985 








990 






Gly Thr 


Gin 


Val 


Arg 


Val 


Thr 


Val 


Phe 


Pro 


Ser 


Lys 


Thr 


Val 


Ala 


Gin 






995 










1000 








1005 






Tyr 


Ser 


Gly 


Val 


Pro 


Trp 


Trp 


He 


He 


Leu 


Val 


Ala 


lie 


Leu 


Ala 


Gly 




1010 








1015 








1020 






He 


Leu 


Met 


Leu 


Ala 


Leu 


Leu 


Val 


Phe 


He 


Leu 


Trp Lys 


Cys 


Gly 


Phe 


1025 








1030 








1035 




1040 


Phe 


Lys 


Arg 


Asn 


Lys 


Lys Asp 


His 


Tyr 


Asp 


Ala 


Thr 


Tyr 


His 


Lys 


Ala 










1045 








1050 








1055 


Glu 


He 


His 


Ala 


Gin 


Pro Ser Asp 


Lys 


Glu 


Arg 


Leu 


Thr 


Ser 


Asp 

) 


Ala 








1060 








1065 








107( 





<210> 3 
<211> 3222 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Mote = 
synthetic construct 

<400> 3 

atggccgtcg egggecagtt gtgcctgccc tacctgtccg eggggcttet agcccggctg 60 

ggtacagect tcaacctgga cacccgcgag gacaacgtga teeggaaate gggggatccc 120 

gggagectet tcggcttctc gctcgccatg cactggcagt tgcagccgga ggacaagegg ISO 

ctgttgcttg tgggggcacc tegggcagaa gcactcccgc tgeagaggge gaacagaaca 240 

gggggcctgt acagctgtga catcacctcc cgaggacctt gtacaeggat tgaatttgac 300 
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aatgacgctg atcctabgtc agaaagcaag gaagaccagb ggatgggagt cacbgtccag 360 

agccaaggtc cagggggcaa agtggtgacg tgtgcacatc gatatgagaa acggcagcac 420 

gtcaacacga agcaggagtc gcgggatatc tttggaagat gttatgtcct gagtcagaat 480 

ctcagaattg aagatgatat ggacggagga gactggagtt tctgcgatgg ccggttgaga 540 

ggccatgaaa agtttggctc ctgtcagcaa ggagtagcgg ctactttcac taaggacttt 600 

cattacattg tttttggagc cccagggact tacaactgga aagggatcgt ccgtgtagaa 660 

caaaagaata acactttttt fcgacatgaac abctttgaag atgggcccta tgaagttggt 720 

ggagagacag atcatgatga aagtctcgtg cccgttcctg ctaacagtta cctaggcttt 780 

tcgctggact cagggaaggg tattgtttct aaagatgaca tcacttttgt gbcbggbgct 840 

ccaagagcca atcacagtgg ggctgtagtg ttgctaaaaa gagacatgaa gtccgcacat 900 

ctgctccctg agtatatatt tgacggagaa ggcctggctt cctcgtttgg ctatgatctg 960 

gcagtggtgg acctcaatgc agatgggtgg caagacatcg ttatcggagc tccacagtat 1020 

bttgataggg atggtgaagt cgggggtgca gtttacgtct acattaacca gcaaggcaaa 1080 

tggagtaatg tgaagccgat tcgtctaaat gggaccaaag actcgatgtt tggaatctct 1140 

gtgaaaaata taggtgatat taaccaagat ggctatccag atattgctgt tggagctccc 1200 

tatgatgatc tggggaaggt ttttatctat catggatccc cgactggcat aattaccaag 12 60 

ccaacacagg tbctcgaggg gacatcgcct tacttcggct attcaatcgc tgggaatatg 1320 

gacctggatc ggaattccta ccccgacctt gctgtgggct ccctctcaga ctcggtcact 1380 

atfcttcagat cccggccagt gattaacatt ctaaaaacca tcacagtgac tcctaacaga 1440 

attgacctcc gccagaagtc catgtgtggc tcacctagcg ggatatgcct caaggttaaa 1500 

gcctgttttg aatatactgc gaaaccttcc ggtfcataacc ctccaatatc aattttgggt 1560 

attcttgaag ctgaaaaaga aagaagaaaa tcagggttgt catcgagagt tcagtttcga 1620 

aaccaaggtb ccgagccaaa gtatactcag gagctgaccc tgaatcggca gaagcagcgg 1680 

gcgtgcatgg aggagaccct ctggctgcag gagaacatca gagacaagct gcgtcccatc 1740 

cccatcacgg cttctgtgga gatccaggag cccacgtctc gccggcgggt gaactcactc 1800 

cccgaagttc ttcccatcct gaattcaaat gaagccaaaa cggtccagac agatgtccac 1860 

ttcttaaagg aaggatgtgg agacgacaat gtctgtaaca gcaaccttaa gctagagtat 1920 

aaatttggta cccgagaagg aaatcaagac aaattctctt accttccaat tcaaaaaggc 1980 

atcccagaat tagtcctaaa agatcagaaa gatatagctc tggaaataac ggtgaccaac 204 0 

agcccttcgg abccaaggaa tccccggaaa gatggcgacg atgcccatga agccaaactc 2100 

atcgccacgt ttccagacac tctgacatat tccgcttaca gagaactgag ggctttccct 2160 

gagaagcagc tgagctgtgt ggccaaccag aatggctccc aagccgactg tgagctcgga 2220 

aatcctttca agagaaattc cagtgttact ttctatctga ttttaagtac aaccgaggtc 2280 

acctttgaca ccacagatct ggatattaat ctgaagttgg aaacaacaag caatcaggat 2340 

aaattggctc caattacagc gaaggcaaaa gtggttattg aattgctttt atccctctcc 2400 

ggagtcgcta agccttcgca ggtgtatttt ggaggtacag ttgttggtga gcaagcfcatg 24 60 

aaatctgaag atgaagtagg aagtttaata gagtatgaat ttagggtgat taacttaggc 2520 

aagccfcctta aaaacctcgg cacagcaacc ttgaatatac agtggcccaa ggagattagc 2580 

aatggcaaat ggttgcttta tttgatgaaa gttgaatcca aaggtttgga gcagattgfcfc 2640 

tgtgagccac acaatgaaat aaactacctg aagctgaagg agtctcacaa ctcaagaaag 2700 

aaacgggaac ttcctgaaaa acagatagat gacagcagga aattttcttt atttcctgaa 2760 

agaaaatacc agactctcaa ctgcagcgtc aacgtcaggt gtgtgaacat caggtgccca 2820 

ctgcgagggc tggacacgaa ggcctctctc gttctgtgtt ccaggttgtg gaacagcaca 2880 

tttctagagg aatattccaa actgaactac fcfcggacattc fccgtgagggc bfcccatagat 294 0 

gbcaccgcbg cbgcbcagaa tabcaagcfcc ccbcacgcgg gcacbcaggt tcgagbgacg 3000 

gbgbbbccct caaagacbgt agctcagtab bcaggagbag cttggbggab catcctccbg 3060 

gctgbbcbbg ccgggabtct gabgctggcb cbabbagbgt bbtbactgtg gaagbgtggc 3120 

bbctbcaaga gaaabaagaa agatcabbac gabgccaccb atcacaaggc bgagabccab 3180 

acbcagccgb cbgabaaaga gaggctbacb bccgatgcat ag 3222 

<210> 4 
<211> 1073 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 4 

Met Ala Val Ala Gly Gin Leu Cys Leu Leu Tyr Leu Ser Ala Gly Leu 
1 5 10 .15 
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Leu 


Ala 


Ara 


Leu 








20 


Val 


He 


Arg 


Lys 






35 




Ala 


Met 


His 


Trp 




50 






Glv 


Ala 


pro 


Arg 


65 








Glv 


Gly 


xic u 


iyi 


Tl p 

x x c 


Gl i> 

ox u 


Pho 
rue 


Asp 








100 


Gin 


Trp 


Met 


Gly 






115 

X X *J 




Val 

v d X 


Thr 

X l IX 




Ala 
rid. a 




130 

X .J \J 






Gin 


Glu 


OCX 


Arg 


145 








TiO n 
tic u 


A r rr 
rix y 


Tip 
lie 


Gl 1 1 

ox u 


g i v 


Arg 


Leu 


Arg 








180 


Ala 


Ala 


Thr 


Phe 






1 




Gl v 


X I lx 


Tyr 


Asn 




?i n 

X \J 






1 ul 


pho 

r ne 


rjie 


Asp 


2 2 5 

ii c J 








Gl v 


gi n 

V3XU 


Thr 
l nx 


Sen 


i yx 


Leu 


gi v 


rue 








260 


Asp 


He 


Thr 


Phe 






275 




V CI X 


Val 
vox 


lit; LI 


IfC U 




<i ^ \j 






i yr 


Tl o 


Pho 


21 e r"i 
Map 










a l ^ 


Va 1 


vai 


Asp 


7\ "l -a 


rxO 


bin 


Tyr 








340 


Val 


Tyr 


He 


Asn 






7 5 5 

3 0 0 




iJCU 


A c r\ 


Gl \7 

biy 


Thr 

inr 




370 






nop 


Tl o 


lien 


? S 

JO J 








T w v 

lyr 




Asp 


L6U 


Tip 
xlc 


Tl o 
lie 


Thr 

l nr 


Lys 








420 


Gly 


Tyr 


Ser 


lie 






4 ?5 




Asp 


Leu 


Ala 

Mia 


^ ai 




450 






Arg 


Pro 


Val 


He 


465 








lie 


Asp 


Leu 


Arg 


Leu 


Lys 


Val 


Lys 








500 



Glv 


Thr 


Ala 


Phe 


Ser 

<J ex 


Gl V 


Asp 


Pro 








40 


Gin 


Leu 


Gin 


Pro 










Ala 




Al a 


T.pn 
xi c u 




70 






^or 




Asp 


Tl o 

lie 


85 








Sen 


Asp 


Ala 

ai a 


Asp 


vai 


mr 


vai 


bin 








120 


His 


Arg 


Tyr 


Glu 






IOJ 




Asp 


Tl o 

ne 


rne 


biy 




i ^n 






Asp 


Asp 


Mo f- 

wee 


Asp 


1 6^ 








biy 


nlS 


blU 


Lys 


inr 


Lys 


ASp 


Pne 








200 


Trp 


Lys 


Gly 


He 










L v ie u 


Asn 


T 1 o 

lie 


OK ta 

rne 










nib 


Asp 


blu 




?4 5 

t. *a J 








OCX 


Leu 


21 cn 


Ser 




Ser 


CI 17 


Al a 

Mia 








280 


Lys 


Arg 


Asp 


Met 










Pi tr 

biy 


pi ii 
blU 


biy 


Leu 




■Jin 
JlU 






Leu 


Asn 


Ala 


ASp 


OZ 0 






Ok- 

rne 


Asp 


Arg 


Asp 


Gin 


Gin 


Gly 


Lys 








360 


Lys 


Asp 


Ser 


Met 






J / o 




pi n. 

bin 


Asp 


p.1 \/ 


Tyr 










Gly 


Lys 


val 


Phe 


4 lO 








Pro 


inr 


bin 


val 


ax a 


biy 


Asn 


Kyi _ J_ 

Met 








4 4 n 

4 4 1/ 


Gly 


Ser 


Leu 


Ser 






455 




Asn 


He 


Leu 


Lys 




470 




Gin 


Lys 


Ser 


Met 


485 








Ala 


Cys 


Phe 


Glu 



6 



Asn 


Leu 


nap 


Thr 
1 nx 


25 






Glv 


Ser 


TtO 1 1 


Pho 


Glu 


Asp 




A n 
r\x \j 








60 


Pro 


Leu 


Gin 


Arg 






75 




Thr 


OCX 


Arg 


Gl i/ 

biy 








Jr X O 


net 


Ser 


blU 


1 05 

IwJ 








Cat- 
OCX 


Gin 

bin 


biy 


Pro 


Lys 


Arg 


bin 


MIS 








140 


Arg 


Cys 


Tyr 


Val 






ICC 
10 0 




p.1 

biy 


biy 


Asp 


Trp 




1 70 

1 ' V 






Pho 

rne 


biy 


Ser 


Cys 


10 0 








riiS 


Tyr 


lie 


vai 


\7=t 1 

vai 


Arg 


vai 


blU 








220 


Glu 


Asp 


Gly 


Pro 






ZOO 




Leu 


v ai 


Pro 


v ai 










Gl V7 

biy 


Lys 


Cl w 

biy 


T 1 o 

i ie 


2 65 








Pro 


Arg 


Mia 


Asn 


Lys 


Ser 


Mia 


nlS 








300 


Ala 


Ser 


Ser 


Phe 






010 




Gl w 

biy 


Trp 


bin 


Asp 




7 ^n 

0 JU 






biy 


blU 


vai 


pi „ 
biy 


O 4 o 








Trp 


Ser 


Asn 


val 


Pho 

trne 


biy 


lie 


ber 








380 


Pro 


Asp 


He 


Ala 






0 20 




Tl o 

lie 


Tyr 


WIS 


biy 




4 1 n 

ft 1 u 






T.o\i 


blU 


biy 


i nr 


4 ?5 

H J 






MSp 


Leu 


ASp 


Arg 


Asp 


Ser 


Val 


Thr 








4 60 


Thr 


He 


Thr 


Val 






475 




Cys 


Gly 


Ser 


Pro 




490 






Tyr 


Thr 


Ala 


Lys 



505 



Arg 


Glu 


lien 


Asn 




30 






Glv 

O X y 


Php 


^or 

OCX 


Leu 


4 5 








T >o i i 
XlC u 


lie U 


Leu 


vai 


A 1 ^ 

Mia 


As n 


Arg 


inr 








80 


Pro 


Cys 


Thr 


Arg 






y 0 


Ser 


Lys 


Glu 


Asp 




i i n 

1 1 u 






Gly 


p T t , 

Gly 


Lys 


Val 


lzo 








Val 


Asn 


Thr 


Lys 


Leu 


Ser 


Gin 


Asn 








160 


Ser 


Phe 


Cys 


Asp 






i/O 




bin 


P 1 M 

Gin 


Gly 


Val 




lyu 




nu _ 

krne 


pi • . 
Gly 


Ala 


Pro 


zuo 








p -I _ 

Gin 


Lys 


Asn 


Asn 


Tyr 


Glu 


Val 


Gly 








240 


Pro 


Ala 


Asn 


Ser 






ICC 

ZOO 




vai 


Ser 


Lys 


ASp 










IT ' _ 

MIS 


oer 


Gly 


Ala 


0 ^ 
ZOO 








Leu 


Leu 


Pro 


blU 


Gly 


Tyr 


TV m _ 

Asp 


Val 








320 


He 


val 


lie 


Gly 






ooo 




p i < » 
Gly 


Ala 


Val 


Tyr 




ooU 






Lys 


Pro 


Tl- 
116 


Arg 


0 00 








val 


Lys 


Asn 


He 


val 


Gly 


Ala 


Pro 








400 


Ser 


Pro 


Thr 


Gly 






4 10 


Q y~v -V- 

oer 


fro 


Tyr 


rne 




4 ^n 

4 O V 






Asn 


C A v» 

ser 


Tyr 


Pro 


445 








He 


Phe 


Arg 


Ser 


Thr 


Pro 


Asn 


Arg 








430 


Ser 


Gly 


He 


Cys 






495 




Pro 


Ser 


Gly 


Tyr 



510 
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Asn 


Pro 


Pro 


He 






DID 




Arg Lys 


ber 


b±y 




530 






Glu 


Pro 


Lys 


Tyr 


545 








Ala 


Cys 


Met 


Glu 


Leu 


Arg 


Pro 


He 








580 


Ser Arg 


Arg 


Arg 






595 




Ser 


Asn 


Glu 


Ala 




610 






Gly Cys 


Gly 


Asp 


625 








Lys 


Phe 


Gly 


Thr 


He 


Gin 


Lys 


Gly 








660 


Ala 


Leu 


Glu 


He 






675 




Arg Lys 


Asp 


Gly 




690 






Pro Asp 


Thr 


Leu 


705 








Glu 


Lys 


Gin 


Leu 


Cys 


Glu 


Leu 


Gly 








740 


Leu 


He 


Leu 


Ser 






755 




He 


Asn 


Leu 


Lys 




770 






He 


Thr 


Ala 


Lys 


785 








Gly Val 


Ala 


Lys 


Glu 


Gin 


Ala 


Met 








820 


Glu 


Phe 


Arg 


Val 






835 




Ala 


Thr 


Leu 


Asn 




850 






Leu 


Leu 


Tyr 


Leu 


865 








Cys 


Glu 


Pro 


His 


Asn 


Ser 


Arg 


Lys 








900 


Arg 


Lys 


Phe 


Ser 






915 




Ser 


Val 


Asn 


Val 




930 






Asp 


Thr 


Lys 


Ala 


945 








Phe 


Leu 


Glu 


Glu 


Ala 


Ser 


He 


Asp 








980 


Ala 


Gly 


Thr 


Gin 






995 





Ser 


lie 


ireu 


Gly 








520 


Leu 


Ser 


Ser 


Arg 






535 




Thr 


Gin 


Glu 


Leu 




DDO 






Glu 


Thr 


Leu 


Trp 


^ a c; 
DDD 






Pro 


He 


Thr 


Ala 


Val 


Asn 


Ser 


Leu 








600 


Lys 


Thr 


val 


Gin 






615 




Asp 


Asn 


Val 


Cys 




630 






Arg 


Glu 


Gly 


Asn 


645 








He 


Pro 


Glu 


Leu 


Thr 


Val 


Thr 


Asn 








680 


Asp 


Asp 


Ala 


His 






695 




Thr 


Tyr 


Ser 


Ala 




710 






Ser 


Cys 


Val 


Ala 


725 








Asn 


Pro 


Phe 


Lys 


Thr 


Thr 


Glu 


Val 








760 


Leu 


Glu 


Thr 


Thr 






775 




Ala 


Lys 


.Val 


Val 




790 






Pro 


Ser 


Gin 


Val 


805 








Lys 


Ser 


Glu 


Asp 


He 


Asn 


Leu 


Gly 








840 


He 


Gin 


Trp 


Pro 






855 




Met 


Lys 


Val 


Glu 




870 






Asn 


Glu 


He 


Asn 


385 








Lys 


Arg 


Glu 


Leu 


Leu 


Phe 


Pro 


Glu 








920 


Arg 


Cys 


Val 


Asn 






y jo 




Ser 


Leu 


Val 


Leu 




950 






Tyr 


Ser 


Lys 


Leu 


965 








Val 


Thr 


Ala 


Ala 


Val 


Arg 


Val 


Thr 



1000 



7 



lie 


Leu 


b±U 


Ala 


val 


bin 


rue 


Arg 








540 


Thr 


Leu 


Asn 


Arg 






555 


Leu 


Gin 


Glu 


Asn 




1 i r> 

570 






Ser 


Val 


Glu 


He 


roc 

5 o 5 








Pro 


Glu 


Val 


Leu 


Thr 


Asp 


Val 


His 








620 


Asn 


Ser 


Asn 


Leu 






635 




Gin 


Asp 


Lys 


Phe 




650 






Val 


Leu 


Lys 


Asp 


665 








Ser 


Pro 


Ser 


Asp 


Glu 


Ala 


Lys 


Leu 








700 


Tyr 


Arg 


Glu 


Leu 






715 




Asn 


Gin 


Asn 


Gly 




730 






Arg 


Asn 


Ser 


Ser 


7 45 








Thr 


Phe 


Asp 


Thr 


Ser 


Asn 


Gin 


Asp 








780 


He 


Glu 


Leu 


Leu 






795 




Tyr 


Phe 


Gly 


Gly 




810 






Glu 


Val 


Gly 


Ser 


825 








Lys 


Pro 


Leu 


Lys 


Lys 


Glu 


He 


Ser 








860 


Ser 


Lys 


Gly 


Leu 






875 




Tyr 


Leu 


Lys 


Leu 




o rt t\ 

890 






Pro 


Glu 


Lys 


Gin 


905 








Arg 


Lys 


Tyr 


Gin 


He 


Arg 


Cys 


Pro 








940 


Cys 


Ser 


Arg 


Leu 






955 




Asn 


Tyr 


Leu 


Asp 




970 






Ala 


Gin 


Asn 


lie 


985 








Val 


Phe 


Pro 


Ser 



Glu 


Lys 


Glu 


Arg 


525 








Asn 


Gin 


Gly 


Ser 


Gin 


Lys 


Gin 


Arg 








560 


He 


Arg 


Asp 


Lys 






57 5 




Gin 


Glu 


Pro 


Thr 




590 






Pro 


He 


Leu 


Asn 


605 








Phe 


Leu 


Lys 


Glu 


Lys 


Leu 


Glu 


Tyr 








640 


Ser 


Tyr 


Leu 


Pro 




655 




Gin 


Lys 


Asp 


He 




670 






Pro 


Arg 


Asn 


Pro 


685 






He 


Ala 


Thr 


Phe 


Arg 


Ala 


Phe 


Pro 








720 


Ser 


Gin 


Ala 


Asp 






735 




Val 


Thr 


Phe 


Tyr 




750 






Thr 


Asp 


Leu 


Asp 


765 








Lys 


Leu 


Ala 


Pro 


Leu 


Ser 


Leu 


Ser 








800 


Thr 


Val 


Val 


Gly 






815 




Leu 


He 


Glu 


Tyr 




830 




Asn 


Leu 


Gly 


Thr 


845 








Asn 


Gly 


Lys 


Trp 


Glu 


Gin 


He 


Val 








880 


Lys 


Glu 


Ser 


His 






895 




He 


Asp 


Asp 


Ser 




910 






Thr 


Leu 


Asn 


Cys 


925 






Leu 


Arg 


Gly 


Leu 


Trp 


Asn 


Ser 


Thr 








960 


He 


Leu 


Val 


Arg 






975 




Lys 


Leu 


Pro 


His 




990 






Lys 


Thr 


Val 


Ala 



1005 
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Gin Tyr Ser Gly Val Ala Trp Trp lie lie Leu Leu Ala Val Leu Ala 



?he Phe Lys Arg Asn Lys Lys Asp His Tvr Asp Ala Thr Tyr His Lys 

1045 1050 1055 

Ala Glu lie His Thr Gin Pro Ser Asp Lys Glu Arg Leu Thr Ser Asp 
1060 1065 " 1070 

Ala 

<210> 5 
<211> 5622 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<400> 5 

atggcagggc cacgccccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccag tgaagagctg cacggagtgt 120 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 24 0 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt tgagctggag 3 60 

gtgtttgagc cactggagag ccccgtggac ctgtacatcc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 4 30 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 540 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccttc 600 

tccttcaaga acgtcatcag cctgacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc catcctgcag 720 

acagctgtgt gcacgaggga cattggctgg cgcccggaca gcacccacct gctggtcttc 780 

tccaccgagt cagccttcca ctafcgaggct gatggcgcca acgtgctggc tggcatcatg 840 

agccgcaacg atgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgcctgctcg ccaagcacaa catcatcccc 960 

atctttgctg tcaccaacta ctcctatagc tactacgaga agcttcacac ctatttccct 1020 

gtctcctcac tgggggtgct gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgctc caacctggac atccgggccc tagacagccc ccgaggcctt 1140 

cggacagagg tcacctccaa gatgttccag aagacgagga ctgggtcctt tcacatccgg 1200 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 12 60 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 14 4 0 

agcgagggcfc ggagtggcca gacctgcaac tgctccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcggg 1560 

cactgtgtgt gctacggcga aggccgctac gagggtcagt tctgcgagta tgacaacttc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1680 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc actgtgagtg tggccgctgc 1800 

cactgccacc agcagtcgct ctacacggac accatctgcg agatcaacta ctcggcgtcc 1860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 2040 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc tggtggctca tccccctgct cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 

tgcctggcac ttctcccgtg ctgcaaccga ggtcacatgg tgggcttfcaa ggaagaccac 2280 

tacatgctgc gggagaacct gatggcctct caccacttgg acacgcccat gctgcgcagc 2340 

gggaacctca agggccgtga cgtggtccgc tggaaggtca ccaacaacat gcagcggcct 2400 

ggctttgcca ctcatgccgc cagcatcaac cccacagagc tggtgcccta cgggctgtcc 24 60 
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ttgcgcctgg cccgcctfctg caccgagaac ctgctgaagc ctgacactcg ggagtgcgcc 2520 

cagctgcgcc aggaggfcgga ggagaacctg aacgaggtcfc acaggcagat ctccggtgta 2580 

cacaagctcc agcagaccaa gttccggcag cagcccaatg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gatggcgccc cgctcggcca agccggccct gcfcgaagcti: 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggctactac 2760 

accctcactg cagaccagga cgcccggggc atggtggagt tccaggaggg cgtggagcfcg 2820 

gtggacgtac gggtgccccfc ctttacccgg cctgaggatg acgacgagaa gcagctgctg 2880 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggtaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctthg agcagcctga gctctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagctc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgcc 3240 

cagctcagca accctaagtt tggggcccac ctgggccagc cccactccac caccatcafcc 3300 

atcagggacc cagatgaact ggaccggagc ttcacgagtc agatgttgtc atcacagcca 3360 

ccccctcacg gcgacctggg cgccccgcag aaccccaatg ctaaggccgc tgggtccagg 3420 

aagatccatt tcaactggct gcccccttct ggcaagccaa tggggtacag ggtaaagtac 34 30 

tggattcagg gfcgactccga atccgaagcc cacctgctcg acagcaaggt gccctcagtg 354 0 

gagctcacca acctgtaccc gtattgcgac tatgagatga aggtgtgcgc ctacggggct 3600 

cagggcgagg gaccctacag ctccctggtg tcctgccgca cccaccagga agtgcccagc 3660 

gagccagggc gtctggcctt caatgtcgtc tcctccacgg tgacccagct gagctgggct 3720 

cagccggctg agaccaacgg tgagatcaca gcctacgagg tctgctatgg cctggtcaac 3780 

gatgacaacc gacctattgg gcccatgaag aaagtgctgg ttgacaaccc taagaaccgg 3840 

atgctgctta ttgagaacct tcgggagtcc cagccctacc gctacacggt gaaggcgcgc 3900 

aacggggccg gctgggggcc tgagcgggag gccatcatca acctggccac ccagcccaag 3960 

aggcccatgt ccatccccat catccctgac atccctatcg tggacgccca gagcggggag 4020 

gactacgaca gcttccttat gtacagcgat gacgttctac gctctccatc gggcagccag 4080 

aggcccagcg tctccgatga cactggctgc ggctggaagt tcgagcccct gctgggggag 4140 

gagctggacc tgcggcgcgt cacgtggcgg ctgcccccgg agctcatccc gcgcctgtcg 4200 

gccagcagcg ggcgctcctc cgacgccgag gcgccccacg ggcccccgga cgacggcggc 4260 

gcgggcggga agggcggcag cctgccccgc agtgcgacac ccgggccccc cggagagcac 4320 

ctggtgaatg gccggatgga ctttgccttc ccgggcagca ccaactccct gcacaggatg 4 380 

accacgacca gtgctgctgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4440 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4500 

cactcgacca cactgccgag ggaptactcc accctcacct ccgtctcctc ccacggcctc 4560 

cctcccatct gggaacacgg gaggagcagg cttccgctgt cctgggccct ggggtcccgg 4 620 

agtcgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc fcataatcctg 4680 

gctgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4740 

gacacgccca cccgcctggt gttctctgcc ctggggccca catctctcag agtgagctgg 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 48 60 

ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4 980 

ggccgagagc gtgagggtgt catcaccatfc gaatcccagg tgcacccgca gagcccactg 504 0 

tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag cfcgagctggg agcggccacg gaggcccaat 5160 

ggggatatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg tccagccacc 5220 

gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5280 

aacgtgccct acaagttcaa ggfcgcaggcc aggaccactg agggcttcgg gccagagcgc 5340 

gagggcatca tcaccataga gtcccaggat ggaggtccct tcccgcagct gggcagccgt 54 00 

gccgggcbct tccagcaccc gctgcaaagc gagtacagca gcatctccac cacccacacc 5460 

agcgccaccg agcccttcct agfcgggtccg accctggggg cccagcaccfc ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag tttgtgagcc ggacactgac caccagcgga 5580 

acccttagca cccacatgga ccaacagttc ttccaaactt ga 5622 



<210> 6 
<211> 1873 
<212> ?RT 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial 
synthetic construct 



<400> 6 



wet 


Ala 


Giy 


rro 


Arg 


Pro 


Cam 

Ser 


Pro 


-i 
i 








3 








Leu 


lie 


t*» — — 
Ser 


t 7 mm 1 

Val 


Ser 


Leu 


Ser 


Gly 
















Pro 


va± 


Lys 


Ser 


Cys 


Thr 


Glu 


Cys 
















40 


Tyr 


Lys 


Thr 


Asp 


Glu 


Met 


Phe 


Arg 




3U 










55 


Glu 


Leu 


Leu 


Ala 


Ala 


Gly 


Cys 


Gin 


03 










70 




Ser 


Ser 


Phe 


Gin 


He 


Thr 


Glu 


Glu 










85 








Arg 


Ser 


Gin 


Met 


Ser 


Pro 


Gin 


Gly 








100 








GlU 


Glu 


Arg 


His 


Phe 


Glu 


Leu 


Glu 






lib 










120 


val 


Asp 


Leu 


Tyr 


He 


Leu 


Met 


Asp 














135 


Leu 


Asp 


Asn 


Leu 


Lys 


Lys 


Met 


Gly 












13Q 




uin 


Leu 


Tnr 


Ser 


Asp 


Tyr 


Thr 


He 










165 








val 


Ser 


\ 7 — 1 

Val 


Pro 


Gin 


Thr 


Asp 


Met 








1 Oft 

180 








Trp 


Pro 


Asn 


Ser 


Asp 


Pro 


Pro 


Phe 






193 










200 


Thr 


Glu 


Asp 


Val 


Asp 


Glu 


Phe 


Arg 




210 










215 


Ser 


Giy 


Asn 


Leu 


Asp 


Ala 


Pro 


Glu 


"i "1 c 

225 










230 






Thr 


Ala 


Val 


Cys 


Thr 


Arg 


A;sp 


He 










245 








Leu 


Leu 


Val 


Phe 


Ser 


Thr 


Glu 


Ser 








260 










Ala 


Asn 


Val 


Leu 


Ala 


Gly 


He 


Met 






275 








280 


Leu 


Asp 


Thr 


Thr 


Gly 


Thr 


Tyr 


Thr 




290 










295 




Ser 


Val 


Pro 


Thr 


Leu 


Val 


Arg 


Leu 


305 










310 




lie 


Phe 


Ala 


Val 


Thr 


Asn 


Tyr 


Ser 










325 






Thr 


Tyr 


Phe 


Pro 


Val 


Ser 


Ser 


Leu 








340 










Asn 


lie 


Val 


Glu 


Leu 


Leu 


Glu 


Glu 






355 










fin 


Leu 


Aso 


He 


Arg 


Ala 


Leu 


Asp 


Ser 




370 










375 




Thr 


Ser 


Lys 


Met 


Phe 


Gin 


Lys 


Thr 


385 










390 




Arg 


Gly 


Glu 


Val 


Gly 


He 


Tyr 


Gin 










405 








Val 


Asp 


Gly 


Thr 


His 


val 


Cys 


Gin 








420 








He 


His 


Leu 


Lys 


Pro 


Ser 


Phe 


Ser 






435 










440 


He 


He 


Cys 


Asp 


Val 


Cys 


Thr 


Cys 



450 455 



10 

Sequence; Note = 



Trp 


Ala 


Arg 


Leu Leu Leu 


Ala 


Ala 




10 






15 




Thr 


Leu 


Asn 


Arg Cys Lys 


Lys Ala 


25 






30 






Val 


Arg 


Val 


Asp Lys Asp 
45 


Cys 


Ala 


Asp 


Arg 


Arg 


Cys Asn Thr 


Gin 


Ala 








60 






Arg 


Glu 


Ser 


He Val Val 


Met 


Glu 






75 






80 


Thr 


Gin 


He 


Asp Thr Thr 


Leu Arg 




90 






95 




Leu 


Arg 


Val 


Arg Leu Arg 


Pro 


Gly 


105 






110 






Val 


Phe 


Glu 


Pro Leu Glu 


Ser 


Pro 








125 






Phe 


Ser 


Asn 


Ser Met Ser 


Asp Asp 








140 






Gin 


Asn 


Leu 


Ala Arg Val 


Leu 


Ser 






155 






160 


Gly 


Phe 


Gly 


Lys Phe Val 


Asp Lys 




170 






175 




Arg 


Pro 


Glu 


Lys Leu Lys 


Glu 


Pro 


IOC 

183 






190 






Ser 


Phe 


Lys 


Asn Val He 


Ser 


Leu 








205 






Asn 


Lys 


Leu 


Gin Gly Glu 


Arg 


He 








220 




Gly 


Gly 


Phe 


Asp Ala He 


Leu 


Gin 






235 






240 


Gly 


Trp 


Arg 


Pro Asp Ser 


Thr 


His 




250 






255 




Ala 


Phe 


His 


Tyr Glu Ala 


Asp Gly 


265 






270 






Ser 


Arg 


Asn 


Asp Glu Arg 


Cys 


His 








285 






Gin 


Tyr 


Arg 


Thr Gin Asp 


Tyr 


Pro 








300 






Leu 


Ala 


Lys 


His Asn He 


He 


Pro 






315 






320 


Tyr 


Ser 


Tyr 


Tyr Glu Lys 


Leu 


His 




330 






335 




Gly 


Val 


Leu 


Gin Glu Asp 


Ser 


Ser 


34 5 






350 






Ala 


Phe 


Asn 


Arg He Arg 


Ser 


Asn 








■see 






Pro 


Arg 


Gly 


Leu Arg Thr 


Glu 


Val 








380 






Arg 


Thr 


Gly 


Ser Phe His 


lie Arg 






395 






400 


Val 


Gin 


Leu 


Arg Ala Leu 


Glu 


His 




410 




415 




Leu 


Pro 


Glu 


Asp Gin Lys 


Gly Asn 


425 






430 






Asp 


Gly 


Leu 


Lys Met Asp 


Ala 


Gly 








445 






Glu 


Leu 


Gin 


Lys Glu Val 


Arg 


Ser 



460 
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Ala 


Arg 


Cys 


Ser 


465 








Ser 


Glu 


Gly 


Tro 


Ser 


Asp 


He 


Gin 








500 


Gly Arg 


Gly 


Glu 






J X V 




Arg 


Tyr 


Glu 






530 






Thr 


Ser 




XT J le 


545 






Cys 


Val 


Cys 




Ser 


Asn 


Ala 


Thr 








580 


Gly 


His 


Cys 


Glu 






595 




Thr Asp 


Thr 


He 




610 






Ala Arg 


Thr 


Tx/T" 


625 






Arg Arg 


Arg 




Trp 


Thr 


Car 










660 


Gly Thr 


Arg 


Met 






675 




Ala 


Pro 


Leu 


Gly 




690 






Cys 


Pro 


Pro 




705 






Leu 


Pro 




DC Li 


Cys 


Cys 


- 

Lrys 


Al a 
nla 








740 


Met 


Val 


Gly 


Phe 






755 




Ala 


Ser 


A <?n 
nop 


Mi q 




770 






Gly Arg 




v a z 


785 








Gly 


Phe 


Al a 


Thr 


Tyr Gly 


Leu 


Car 








820 


Lys 


Pro 


Asp 


Thr 






835 




Asn 


Leu 


A 






850 






Gin 


Thr 




Phe 
cite 


365 






Thr 


He 


Val 

vox 


A 


Leu 


Leu 




TfPi > 

JUC u 








900 


Leu 


Lys 


Val 


Ala 






915 




Arg 


Gly 


Met 


Val 




930 






val 


Pro 


Leu 


Phe 


945 









Phe Asn 
470 
Ser Gly 
485 

Pro Cys 

Cys Gin 

Gin Phe 

Leu Cys 
550 

Pro Gly 
565 

Cys He 

Cys Gly 

Cys Glu 

Ala Pro 
630 
Ala Arg 
645 

Arg Glu 

Thr Thr 

Pro Thr 

Ser Phe 
710 
Ala Leu 
725 

Cys Leu 

Lys Glu 

Leu Asp 

Val Arg 
790 
His Ala 
805 

Leu Arg 

Arg Glu 

Val Tyr 

Arg Gin 
870 
Thr Val 
885 

Thr Glu 

Pro Gly 

Glu Phe 

He Arg 
950 



Gly Asp 

Gin Thr 

Leu Arg 

Cys Gly 
520 
Cys Glu 
535 

Asn Asp 

Trp Thr 

Asp Ser 

Arg Cys 
600 
He Asn 
615 

Ala Cys 

Val Arg 

Ala Arg 

Ala Pro 
680 
Ala Leu 
695 

Trp Trp 

Leu Leu 

Ala Leu 

Asp His 
760 
Thr Pro 
775 

Trp Lys 

Ala Ser 

Leu Ala 

Cys Ala 
840 
Arg Gin 
855 

Gin Pro 

Leu Met 

Lys Gin 

Tyr Tyr 
920 
Gin Glu 
935 

Pro Glu 



Phe Val Cys 
475 

Cys Asn Cys 
490 

Glu Gly Glu 
505 

His Cys Val 

Tyr Asp Asn 

Arg Gly Arg 
555 

Gly Pro Ser 

570 
Asn Gly Gly 
585 

His Cys His 

Tyr Ser Ala 

Ser Ala Arg 
635 

Asn Ala Thr 
650 

Arg Trp Trp 
665 

Thr Ala Thr 

Ser Trp Cys 

Leu He Pro 
715 

Leu Leu Cys 

730 
Leu Pro Cys 
745 

Tyr Met Leu 

Met Leu Arg 

Val Thr Asn 
795 

He Asn Pro 

810 
Arg Leu Cys 
825 

Gin Leu Arg 

lie Ser Gly 

Asn Ala Gly 
875 

Ala Pro Arg 

890 
Val Glu Gin 
905 

Thr Leu Thr 

Gly Val Glu 

Asp Asp Asp 
955 



Gly Gin Cys 

Ser Thr Gly 

Asp Lys Pro 
510 

Cys Tyr Gly 

525 
Phe Gin Cys 
540 

Cys Ser Met 

Cys Asp Cys 

He Cys Asn 
590 

Gin Gin Ser 
605 

Ser Thr Arg 
620 

Arg Gly Ala 

Ser Arg Ser 

Cys Ala Ala 
670 

Pro Trp Lys 
685 

Thr Arg Arg 
700 

Leu Leu Leu 

Trp Lys Tyr 

Cys Asn Arg 
750 

Arg Glu Asn 

765 
Ser Gly Asn 
780 

Asn Met Gin 

Thr Glu Leu 

Thr Glu Asn 
830 

Gin Glu Val 

845 
Val His Lys 
860 

Lys Lys Gin 

Ser Ala Lys 

Arg Ala Phe. 

910 

Ala Asp Gin 

925 
Leu Val Asp 
940 

Glu Lys Gin 



Val Cys 
480 
Ser Leu 
495 

Cys Ser 

Glu Gly 

Pro Arg 

Gly Gin 
560 
Pro Leu 
575 

Gly Arg 

Leu Tyr 

Ala Ser 

Pro Ala 
640 
Arg Trp 
655 

Pro Ser 

Val Thr 

Arg Asp 

Leu Leu 
720 
Cys Ala 
735 

Gly His 

Leu Met 

Leu Lys 

Arg Pro 
800 
Val Pro 
815 

Leu Leu 

Glu Glu 

Leu Gin 

Asp His 
380 
Pro Ala 
895 

His Asp 

Asp Ala 

Val Arg 

Leu Leu 
960 
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Val Glu Ala He Asp Val Pro Ala Gly Thr Ala Thr Leu Gly Arg Arg 

965 970 975 

Leu Val Asn He Thr He He Lys Glu Gin Ala Arg Asp Val Val Ser 

980 985 990 

Phe Glu Gin Pro Glu Phe Ser Val Ser Arg Gly Asp Gin Val Ala Arg 

995 1000 1005 

He Pro Val He Arg Arg Val Leu Asp Gly Gly Lys Ser Gin Val Ser 

1010 1015 1020 

Tyr Arg Thr Gin Asp Gly Thr Ala Gin Gly Asn Arg Asp Tyr He Pro 
1025 1030 1035 " " 1040 

Val Glu Gly Glu Leu Leu Phe Gin Pro Gly Glu Ala Tro Lys Glu Leu 

1045 1050 " 1055 

Gin Val Lys Leu Leu Glu Leu Gin Glu Val Asp Ser Leu Leu Arg Gly 

1060 1065 1070 

Arg Gin Val Arg Arg Phe His Val Gin Leu Ser Asn Pro Lys Phe Gly 

1075 1080 1085 

Ala His Leu Gly Gin Pro His Ser Thr Thr He He He Arg Asp Pro 

1090 1095 1100 

Asp Glu Leu Asp Arg Ser Phe Thr Ser Gin Met Leu Ser Ser Gin Pro 
1105 HlO 1115 1120 

Pro Pro His Gly Asp Leu Gly Ala Pro Gin Asn Pro Asn Ala Lys Ala 

1125 1130 1135 

Ala Gly Ser Arg Lys He His Phe Asn Trp Leu Pro Pro Ser Gly Lys 

1140 1145 1150 

Pro Met Gly Tyr Arg Val Lys Tyr Trp He Gin Gly Asp Ser Glu Ser 

1155 1160 1165 

Glu Ala His Leu Leu Asp Ser Lys Val Pro Ser Val Glu Leu Thr Asn 

1170 1175 1180 

Leu Tyr Pro Tyr Cys Asp Tyr Glu Met Lys Val Cys Ala Tyr Gly Ala 
1185 1190 1195 * 1200 

Gin Gly Glu Gly Pro Tyr Ser Ser Leu Val Ser Cys Arg Thr His Gin 

1205 1210 " 1215 

Glu Val Pro Ser Glu Pro Gly Arg Leu Ala Phe Asn Val Val Ser Ser 

1220 1225 1230 

Thr Val Thr Gin Leu Ser Trp Ala Glu Pro Ala Glu Thr Asn Gly Glu 

1235 1240 1245 

He Thr Ala Tyr Glu Val Cys Tyr Gly Leu Val Asn Asp Asp Asn Arg 

1250 1255 1260 

Pro He Gly Pro Met Lys Lys Val Leu Val Asp Asn Pro Lys Asn Arg 
1265 1270 1275 1280 

Met Leu Leu He Glu Asn Leu Arg Glu Ser Gin Pro Tyr Arg Tyr Thr 

1285 1290 " 1295 

Val Lys Ala Arg Asn Gly Ala Gly Trp Gly Pro Glu Arg Glu Ala He 

1300 1305 1310 

Tie Asn Leu Ala Thr Gin Pro Lys Arg Pro Met Ser He Pro He lie 

1315 1320 1325 

Pro Asp Tie Pro lie Val Asp Ala Gin Ser Gly Glu Asp Tyr Asp Ser 

1330 1335 1340 

Phe Leu Met Tyr Ser Asp Asp Val Leu Arg Ser Pro Ser Gly Ser Gin 
1345 1350 1355 1360 

Arg Pro Ser Val Ser Asp Asp Thr Gly Cys Gly Trp Lys Phe Glu Pro 

1365 1370 1375 

Leu Leu Gly Glu Glu Leu Asp Leu Arg Arg Val Thr Trp Arg Leu Pro 

1380 1385 1390 

Pro Glu Leu He Pro Arg Leu Ser Ala Ser Ser Gly Arg Ser Ser Aso 

1395 1400 1405 

Ala Glu Ala Pro His Gly Pro Pro Asp Asp Gly Gly Ala Gly Gly Lys 

1410 1415 * ~ 1420 

Gly Gly Ser Leu Pro Arg Ser Ala Thr Pro Gly Pro Pro Gly Glu His 
1425 1430 1435 1440 

Leu Val Asn Gly Arg Met Asp Phe Ala Phe Pro Gly Ser Thr Asn Ser 
1445 1450 1455 
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Leu His Arg Met Thr Thr Thr Ser Ala Ala Ala Tyr Gly Thr His Leu 

1460 1465 ^ 1470 

Ser Pro His Val Pro His Arg Val Leu Ser Thr Ser Ser Thr Leu Thr 

1475 1480 1485 

Arg Asp Tyr Asn Ser Leu Thr Arg Ser Glu His Ser His Ser Thr Thr 

1490 1495 1500 

Leu Pro Arg Asp Tyr Ser Thr Leu Thr Ser Val Ser Ser His Gly Leu 
1505 1510 1515 1520 

Pro Pro lie Trp Glu His Gly Arg Ser Arg Leu Pro Leu Ser Trp Ala 

1525 1530 1535 

Leu Gly Ser Arg Ser Arg Ala Gin Met Lys Gly Phe Pro Pro Ser Arg 

1540 1545 1550 

Gly Pro Arg Asp Ser lie lie Leu Ala Gly Arg Pro Ala Ala Pro Ser 

1555 1560 1565 

Trp Gly Pro Asp Ser Arg Leu Thr Ala Gly Val Pro Asp Thr Pro Thr 

1570 1575 1580 

Arg Leu Val Phe Ser Ala Leu Gly Pro Thr Ser Leu Arg Val Ser Trp 
1585 1590 1595 1600 

Gin Glu Pro Arg Cys Glu Arg Pro Leu Gin Gly Tyr Ser Val Glu Tyr 

1605 1610 1615 

Gin Leu Leu Asn Gly Gly Glu Leu His Arg Leu Asn lie Pro Asn Pro 

1620 1625 1630 

Ala Gin Thr Ser Val Val Val Glu Asp Leu Leu Pro Asn His Ser Tyr 

1635 1640 1645 

Val Phe Arg Val Arg Ala Gin Ser Gin Glu Gly Trp Gly Arg Glu Arg 

1650 1655 1660 

Glu Gly Val He Thr He Glu Ser Gin Val His Pro Gin Ser Pro Leu 
1665 1670 1675 1680 

Cys Pro Leu Pro Gly Ser Ala Phe Thr Leu Ser Thr Pro Ser Ala Pro 

1685 1690 1695 

Gly Pro Leu Val Phe Thr Ala Leu Ser Pro Asp Ser Leu Gin Leu Ser 

1700 1705 * 1710 

Trp Glu Arg Pro Arg Arg Pro Asn Gly Asp He Val Gly Tyr Leu Val 

1715 1720 1725 

Thr Cys Glu Met Ala Gin Gly Gly Gly Pro Ala Thr Ala Phe Arg Val 

1730 1735 1740 

Asp Gly Asp Ser Pro Glu Ser Arg Leu Thr Val Pro Gly Leu Ser Glu 
1745 1750 1755 1760 

Asn Val Pro Tyr Lys Phe Lys Val Gin Ala Arg Thr Thr Glu Gly Phe 

1765 1770 1775 

Gly Pro Glu Arg Glu Gly He He Thr He Glu Ser Gin Asp Gly Gly 

1780 1785 1790 

Pro Phe Pro Gin Leu Gly Ser Arg Ala Gly Leu Phe Gin His Pro Leu 

1795 1800 1805 

Gin Ser Glu Tyr Ser Ser He Ser Thr Thr His Thr Ser Ala Thr Glu 

1810 1815 1820 

Pro Phe Leu Val Gly Pro Thr Leu Gly Ala Gin His Leu Glu Ala Gly 
1825 1830 1835 1840 

Gly Ser Leu Thr Arg His Val Thr Gin Glu Phe Val Ser Arg Thr Leu 

1845 1850 1855 

Thr Thr Ser Gly Thr Leu Ser Thr His Met Asp Gin Gin Phe Phe Gin 
1860 1865 1870 

Thr 



<210> 7 
<211> 5907 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note - 
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synthetic construct 
<400> 7 

ggaccgtcga ggcagcggga ctgacccagc tgggctcact gtabtaagaa gcggacccgc 60 

gacccggagc gcccggggac ccgatctggg agcctggacg ggdgcagcgc gcaggaatgc 120 

agtccgcctg actcaccagc gcctccttcc hacctgcgcc gcccgtccat aaagcgctgc 180 

tcgtcccgcc cgccgccgcc gccctgctgt cccgccgggc tcgcccgcgc gctcagctcg 240 

acccaacgca gcccaagtcc gaggtagtct cactaaggag gaggaggatg gcagggccct 300 

gttgcagccc atgggtgaag ctgctgctgc tggcacgaat gctgagtgcc agcctccctg 360 

gagacctggc caaccgctgc aagaaggcfcc aggtgaagag ctgtaccgag tgcatccggg 420 

tggacaagag cfcgtgcctac tgcacagacg agctgttcaa ggagaggcgc tgcaacaccc 430 

aggcggacgt tctggctgca ggctgcaggg gagagagcat cctggtcatg gagagcagcc 540 

ttgaaatcac agagaacacc cagatcgtca ccagcctgca ccgcagccag gtatctcccc 600 

aaggcctgca agtccggctg cggcggggtg aggagcgcac gtttgtgttc caggtctttg 660 

agcccctgga gagccccgtg gatctgtata tcctcatgga cttctccaac tccatgtctg 720 

acgatctgga caacctcaag cagatggggc agaacctggc caagatcctg cgccagctca 780 

ccagcgacta caccattgga tttggaaagt ttgtggacaa agtcagcgtc ccacagacag 840 

acatgaggcc cgagaaactg aaggagcccfc ggcccaacag tgatcccccg ttctccttca 900 

agaacgttat cagcttaacg gagaatgtgg aagaattctg gaacaaactg caaggagaac 960 

gcatctcagg caacctggac gctcctgaag ggggctttga tgccatcctg cagacagctg 1020 

tgtgcacaag ggacatfcggc tggagggctg acagcaccca cctgctggtg ttctccaccg 1080 

agtctgccht ccactacgag gctgatggtg ccaacgttct ggccggcatc atgaaccgca 1140 

atgatgagaa atgccacctg gacgcctcgg gcgcctacac ccaatacaag acacaggact 1200 

acccatcagt gcccacgctg gttcgcctgc ttgccaagca taacatcatc cccatctttg 1260 

ctgtcaccaa ctactcttac agctactatg agaagctcca taagtatttc cccgtctcct 1320 

ctctgggcgt cctgcaggag gattcatcca acatcgtgga gctgctggag gaggccttct 1380 

atcgaattcg ctccaacctg gacatccggg ctctggacag ccccagaggc ctgagaacag 1440 

aggtcacctc cgatactctc cagaagacgg agactgggtc ctttcacatc aagcgggggg 1500 

aagtgggcac atacaatgtg catctccggg cagtggagga catagatggg acacatgtgt 1560 

gccagctggc taaagaagac caagggggca acatccacct gaaaccctcc ttctctgatg 1620 

gcctccggat ggacgcgagt gtgatctgtg acgtgtgccc ctgtgagctg caaaaggaag 1680 

ttcgatcagc tcgctgtcac ttcagaggag acttcatgtg tggacactgt gtgtgcaatg 1740 

agggctggag tggcaaaacc tgcaactgct ccaccggctc tctgagtgac acacagccct 1800 

gcctgcgtga gggtgaggac aaaccgtgct cgggccacgg cgagtgccag tgcggacgct 1860 

gtgtgtgcta tggtgaaggc cgctacgagg gtcacttctg cgagtatgac aacttccagt 1920 

gtccccggac ctctggattc c-tgtgcaatg accggggacg ctgttctatg ggagagtgtg 1930 

tgtgtgagcc tggttggaca ggccgcagct gcgactgtcc cctcagcaat gccacctgca 2040 

tcgatagcaa cgggggcatc tgcaacggcc gaggctactg tgagtgtggc cgttgtcact 2100 

gcaaccagca gtcgctctac acggacacca cctgtgagat caactactct gcgatactgg 2160 

gtctctgtga ggatctccgc tcctgcgtac agtgccaggc ctggggcacc ggggagaaga 2220 

aagggcgcgc gtgtgacgat tgccccttta aagtcaagat ggtagacgag cttaagaaag 2230 

aagaggtggt ggagtactgc tccttccggg atgaggatga cgactgcact tacagctaca 2340 

acgtggaggg cgacggcagc cctgggccca acagcacagt cctggtccac aaaaagaaag 2400 

actgcctccc ggctcctfccc tggtggctca tccccctgct catcttcctc ctgttgctcc 24 60 

tggcgttgct tctgctgctc tgctggaaat actgtgcctg ctgcaaagcc tgcctggggc 2520 

ttcttccttg ctgcaaccga ggtcacatgg tgggctttaa ggaagatcac tatatgcttc 2580 

gggagaacct gatggcctct gaccacctgg acacgcccat gctacgaagc gggaacctca 2640 

agggacgaga cacagtccgc tggaagatca ccaacaatgt gcagcgccct ggctttgcca 2700 

cccatgccgc cagcaccagc cccacggagc tcgtacccta cgggctgtcc ctgcgccttg 27 60 

gccgcctctg cactgagaac cttatgaagc cgggcacccg agagtgtgac cagctacgcc 2820 

aggaggtgga ggaaaatctg aatgaggtgt atagacaggt cagcggcgca cacaagctcc 2880 

agcagacgaa gttccgacag cagcccaacg ccgggaaaaa gcaagaccac accattgtgg 2940 

acacagtgtt gctggcgccc cgctccgcca agcagatgct gctgaagctg acagagaagc 3000 

aggtggagca ggggtcctfcc catgaactga aggtggcccc tggctactac actgtcacgg 3060 

cagagcagga tgcccggggc atggtggagt tccaggaggg cgtggagctg gtggafcgtgc 3120 

gagtgcccct cttcatccgg cctgaggatg atgatgagaa gcagctgctg gtggaggcca 3190 

ttgatgtccc tgtgagcact gccacccttg gtcgccgtct ggtaaacatc accattatca 3240 

aggaacaagc tagtggggta gtgtccttcg agcagcctga atactcggtg agtcgtggag 3300 

accaggtggc ccgcatccct gtcatccggc acatcctgga caatggcaag tcccaggtct 3360 

cctatagcac acaggataat acagcacacg gacaccggga ttatgttccc gtggagggag 3420 

agctgctgtt ccatcctggg gaaacctgga aggagttgca ggtgaagcta ctggagcfcgc 3430 

aggaggttga ctccctcctg cgtggccgcc aggtccgccg cttccaagtc caactcagca 3540 
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accccaagtt cggagcccgc ctgggccagc ccagcacaac caccgttatt ctcgatgaaa 3600 

cggacaggag bctcataaab caaacacttt cabcgccbcc gccaccccab ggagaccbgg 3660 

gcgccccaca gaaccccaat gccaaggcbg ccggatccag gaagatccab tbcaactggc 3720 

tgccccctcc tggcaagcca atggggtaca gggtgaagta ctggatccag ggcgactctg 3780 

aatctgaagc ccacctbcfca gatagcaagg bgcccbcagt ggaacbcacc aaccbgtabc 3840 

ccbattgcga ctacgaaatg aaggtgbgbg cctatggggc caagggtgag gggccctata 3900 

gcbcactggt gtcctgccgc acccaccagg aagtacccag hgagccaggg aggcbggcbt 3960 

tcaatgtagt ctcbtctacg gbgactcagc tgagcbgggc agagccagct gagaccaatg 4020 

gcgagatcac agcctacgag gbctgcbabg gactggtcaa tgaggacaac agacccabbg 4080 

gacctatgaa gaaggtgcbc gtggacaacc ccaagaaccg gatgctgctc abtgagaabc 4140 

bgcgagattc ccagccatac cgatacacgg ttaaggcgcg caabggggca ggatggggac 4200 

ccgagagaga ggcbatcabc aacctggcta cacagcccaa gcggcccatg tccatcccba 4260 

bcatcccaga catccccaba gtggacgccc agggtggaga agactacgaa aacbbccbta 4320 

tgtacagtga tgacgtcctg cggtccccag ccagcagcca gaggcccagc gbtbctgabg 4380 

acactgagca cctggtgaat ggccggatgg actttgccta tccaggcagc gccaactccc 4440 

tgcacagaat gactgcagcc aatgtggcct abggcacgca tctgagccca caccbgtccc 4 500 

accgagtgct gagcacgtcc tccaccctta cccgggacta ccacbctctg acacgcacag 4 5 60 

agcactccca ctcaggcaca cttcccaggg actactccac cctcacttcc ctttccbccc 4620 

aagcctccct cctatctggg aagatgggag gagcaggctt ccgctgtcct ggactcttgg 4 680 

gtccttgagc cgggctcaca tgaagggtgt gcccgcatcc aggggttcac cagactcbat 4740 

aabcctggcc gggcagtcag cagcaccctc ctggggtaca ggattcccgb ggggctgtgg 4800 

gtgtgcctga cacacccact cggctggtgt tcbctgccct ggggcgcacg tctfctgaagg 4860 

bgagctggca ggagccacag tgtgatcgga cgctgctggg cbacagtgtg gaataccagc 4 920 

tacbaacgtg cgtcgagatg catcggctca acatccctaa ccctggccaa acctcggtgg 4 980 

tggtagagga tctcctgcct aactactctc atgtgttccg ggtacgggca cagagccagg 5040 

agggctgggg tcgagagcga gagggtgtca tcaccabcga gtcccaggtg cacccgcaga 5100 

gccctctctg ccccctgcca ggctcagcct tcactctgag cacccccagc gccccaggac 5160 

cactggtgtt cacbgcccta agcccagact ccccgcagcb cagctgggag cggccgagga 5220 

gccgcaatgg agatatcctb ggctacctgg tgacctgtga gatggcccaa ggaggagcac 5280 

cagccaggac cttccgggtg gacggagaca accctgagag ccggbtgact gtacctggcc 5340 

tcagtgagaa cgttccttac aagttcaagg ttcaggccag gacgaccgag ggctttgggc 5400 

cagagcgbga gggtatcatc accatcgagt ctcaggbfcgg aggccccttc ccacagcbgg 54 60 

gcagcaattc tgggctcttc cagaacccag tgcaaagcga gttcagcagc gtgaccagca 5520 

cgcacagcac cacgactgag cccttccbca tggatggtcb aaccctgggg acccagcgcc 5580 

bggaagcagg aggctccctc acccggcatg tgacccagga abtcgtgacc cggaccttaa 5640 

cggccagtgg ctctctcagc actcatatgg accaacagtt cttccaaacc bgaacctccc 5700 

ccgcgcccca gccacctggg cccctccttg cctcctcbcc tagcgccttc ttcctctgct 5760 

gctctaccca cgagctbgct gaccacagag ccagcccctg bagtcagaga gcaggggtag 5820 

gtgctgtcca ggaaccataa agtgggtaga ggtgatacaa ggtctttctg actgcatccc 5880 

accctgggtc caatcccaca tgtaacc * 5907 



<210> 8 
<211> 1466 
<212> PRT 

<213> Arbificial Sequence 

<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 8 

Met Ala Gly Pro Cys Cys Ser Pro 

1 5 
Arg Meb Leu Ser Ala Ser Leu Pro 
20 

Lys Ala Gin Val Lys Ser Cys Thr 

35 40 
Cys Ala Tyr Cys Thr Asp Glu Leu 
50 55 



Trp Val Lys Leu Leu Leu Leu Ala 

10 15 
Gly Asp Leu Ala Asn Arg Cys Lys 
25 30 
Glu Cys lie Arg Val Asp Lys Ser 
45 

Phe Lys Glu Arg Arg Cys Asn Thr 
60 
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win nla 


Asp 


vai 








Met bJLU 


Ser 


oel 


1»2U His 


Arg 


ser 






100 


Arg Gly 


Glu 


Glu 




115 




Ser Pro 


Val 


Asp 






Asp Asp 


Leu 


Asp 








Leu Arg 


Gin 


Leu 


Asp Lys 


Val 


Ser 






180 


Glu Pro 


Trp 


Pro 




195 




Ser Leu 


Thr 


Glu 


210 






Arg lie 


Ser 


Gly 






Leu Gin 


Thr 


Ala 


TIL „ TJ ' _ 

Thr His 


Leu 


Leu 






260 


Asp Gly 


Ala 


Asn 




C t D 




Cys His 


Leu 


Asp 


290 






Tyr Pro 


Ser 


Val 


o r\ c 






lie Pro 


lie 


Phe 


Leu His 


Lys 


Tyr 






340 


Ser Ser 


Asn 


He 




35o 




Ser Asn 


Leu 


Asp 


o "j r\ 

370 






Glu Val 


Thr 


Ser 


385 






lie Lys 


Arg 


Gly 


Glu Asp 


lie 


Asp 






420 


Gly Gly 


Asn 


He. 




ATE 

435 




Asp Ala 


Ser 


Val 


4 50 






Val Arg 


Ser 


TV 1 ~ 

Ala 


4 b*3 






Cys Val 


Cys 


Asn 


Gly Ser 


Leu 


Ser 






500 


Pro Cys 


Ser 


Gly 




515 




Gly Glu 


Gly 


Arg 


530 






Cys Pro 


Arg 


Thr 


545 







Leu 


Ala 


Ala 


Gly 




-7 n 

/ 0 






Leu 


Glu 


lie 


Thr 


85 








Gin 


Val 


Ser 


Pro 


Arg 


Thr 


Phe 


Val 








120 


Leu 


Tyr 


He 


Leu 






135 




Asn 


Leu 


Lys 


Gin 




150 






Thr 


Ser 


Asp 


Tyr 


165 








Val 


Pro 


Gin 


Thr 


Asn 


Ser 


Asp 


Pro 








200 


Asn 


Val 


Glu 


Glu 






215 




Asn 


Leu 


Asp 


Ala 




230 






Val 


Cys 


Thr 


Arg 


245 








Val 


Phe 


Ser 


Thr 


Val 


Leu 


Ala 


Gly 








280 


Ala 


Ser 


Gly 


Ala 






295 




Pro 


Thr 


Leu 


Val 




310 






Ala 


Val 


Thr 


Asn 


325 








Phe 


Pro 


.Val 


Ser 


Val 


Glu 


Leu 


Leu 








360 


He 


Arg 


Ala 


Leu 






375 




Asp 


Thr 


Leu 


Gin 




390 






Glu 


Val 


Gly 


Thr 


405 








Gly 


Thr 


His 


Val 


His 


Leu 


Lys 


Pro 








440 


He 


Cys 


Asp 


Val 






455 




Arg 


Cys 


His 


Phe 




470 






Glu 


Gly 


Trp 


Ser 


'too 








Asp 


Thr 


Gin 


Pro 


His 


Gly 


Glu 


Cys 








520 


Tyr 


Glu 


Gly 


His 






535 




Ser 


Gly 


Phe 


Leu 



16 



Cys 


Arg 


Gly 


Glu 






7 5 




Glu 


Asn 


Thr 


Gin 




90 






Gin 


Gly 


Leu 


Gin 


105 








Phe 


Gin 


Val 


Phe 


Met 


Asp 


Phe 


Ser 








140 


Met 


Gly 


Gin 


Asn 






155 




Thr 


He 


Gly 


Phe 




170 






Asp 


Met 


Arg 


Pro 


185 








Pro 


Phe 


Ser 


Phe 


Phe 


Trp 


Asn 


Lys 








220 


Pro 


Glu 


Gly 


Gly 






235 




Asp 


He 


Gly 


Trp 




250 






Glu 


Ser 


Ala 


Phe 


265 








He 


Met 


Asn 


Arg 


Tyr 


Thr 


Gin 


Tyr 








300 


Arg 


Leu 


Leu 


Ala 






315 




Tyr 


Ser 


Tyr 


Ser 




330 






Ser 


Leu 


Gly 


Val 


345 








Glu 


Glu 


Ala 


Phe 


Asp 


Ser 


Pro 


Arg 








380 


Lys 


Thr 


Glu 


Thr 






395 




Tyr 


Asn 


Val 


His 




410 






Cys 


Gin 


Leu 


Ala 


425 








Ser 


Phe 


Ser 


Asp 


Cys 


Pro 


Cys 


Glu 








4 60 


Arg 


Gly 


Asp 


Phe 






475 




Gly 


Lys 


Thr 


Cys 




a q n 






Cys 


Leu 


Arg 


Glu 


505 








Gin 


Cys 


Gly 


Arg 


Phe 


Cys 


Glu 


Tyr 








540 


Cys 


Asn 


Asp 


Arg 



555 



oer 


He 


Leu 


Val 








80 


He 


Val 


Thr 


Ser 






95 




Val 


Arg 


Leu 


Arg 




110 






Glu 


Pro 


Leu 


Glu 


125 








Asn 


Ser 


Met 


Ser 


Leu 


Ala 


Lys 


He 








160 


Gly 


Lys 


Phe 


Val 






175 




Glu 


Lys 


Leu 


Lys 




190 






Lys 


Asn 


Val 


He 


205 








Leu 


Gin 


Gly 


Glu 


Phe 


Asp 


Ala 


He 








240 


Arg 


Ala 


Asp 


Ser 






255 




His 


Tyr 


Glu 


Ala 




270 






Asn 


Asp 


Glu 


Lys 


285 








Lys 


Thr 


Gin 


Asp 


Lys 


His 


Asn 


lie 








320 


Tyr 


Tyr 


Glu 


Lys 






335 




Leu 


Gin 


Glu 


Asp 




350 




Tyr 


Arg 


He 


Arg 


365 








Gly 


Leu 


Arg 


Thr 


Gly 


Ser 


Phe 


His 








400 


Leu 


Arg 


Ala 


Val 






415 




Lys 


Glu 


Asp 


Gin 




430 






Gly 


Leu 


Arg 


Met 


445 








Leu 


Gin 


Lys 


Glu 


Met 


Cys 


Gly 


His 








480 


Asn 


Cys 


Ser 


Thr 






A Q C 

4 y o 




Gly 


Glu 


Asp 


Lys 




510 






Cys 


Val 


Cys 


Tyr 


525 








Asp 


Asn 


Phe 


Gin 


Gly 


Arg 


Cys 


Ser 



560 
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Met Gly 


Glu 


y Z> 


vol 




\J ± u 


r lu 


o jl y 










J U J 








Cys 


Pro 


T>p i ) 

iJCU 


O C JL 


ZA c r> 


rtl a 


jl nr 




Tl o 
J. JL C 








580 








585 


Asn 


Gly 


rti. y 


CI v 
w_l y 


iyr 


Cys 




Cys 


biy 






595 










O vv 




Ser 


Leu 


T vr 

i yj. 


Thr 




i nr 


i nr 


Cys 






610 










D JL J 




Gly Leu 




ft! ii 

*J J. U 


rtsp 


Leu 


Arg 


oer 


Cys 


625 
















Thr Gly 


p 1 it 

bi. U 


Lys 


Lys 


biy 


Arg 


Ala 


Cys 










Ol J 








Lys 


Met 


Vdl 


Asp 


blU 


Leu 


Lys 


Lys 


Glu 








fi^n 
o ou 










c c c 
bob 


Phe 


Arg 


Asp 


pi ii 


Asp 


Asp 


Asp 


Cys 


rp Vi -v.. 

Inr 






675 










6ft n 

DO U 




Asp 


Gly 


oer 


D r""<~\ 
rw 


CI it- 


rro 


Asn 


C Q v- 

oer 


inr 




690 










O? J 






Asp Cys 


Ti All 


rru 


Ala 


Pro 

rro 


Cor 

oer 


Trp 


Trp 


7 05 










' 1U 




Leu 


Leu 


Leu 


Leu 


Leu 


Aia 


Leu 


Leu 


T OM 

Leu 










7?q 










Ala Cys 


Cys 


Lys 


Aia 


Cys 


Leu 


p 1 t ■ 

Gly 


Leu 


















T AC 


His 


Met 


vai 


pi \» 
biy 


rne 


Lys 


GlU 


Asp 


His 
















7 £n 

/ OU 




Met 


Ala 


C d -y 

oer 


Asp 


rilS 


Leu 


Asp 


Tnr 


Pro 




770 










775 






Lys 


Gly 


Arg 


Asp 


rp U _ 

i nr 


vai 


Arg 


Trp 


Lys 


785 










7 an 








Pro 


Gly 


rne 


Til a 

ax a 


inr 


HiS 


Aia 


Ala 


Cam 

Ser 










fin 1 ; 










Pro 


Tyr 


biy 


Leu 


Q a _« 

oer 


Leu 


Arg 


Leu 


Gly 








ft 9 n 










Q 7 C 


Met 


Lys 


Pro 


p*i ^ 

bxy 


inr 


Arg 


GlU 

- 


Cys 


Asp 






0 Jj 








q 4 a 
o 4 0 




Glu 


Asn 


Leu 


Asn 


blU 


val 


Tyr 


Arg 


Gin 




850 










QCC 

OJJ 




Gin 


Gin 


inr 


Lys 


Pne 


Arg 


Gin 


Gin 


Pro 


865 










O "7 A 








His 


Thr 


lie 


val 


Asp 


Tnr 


val 


Leu 


Leu 










0 0 D 










Met 


Leu 


Leu 


Lys 


Leu 


rnr 


p 1 . « 
GlU 


Lys 


Gin 


















q e\ c 


Glu 


Leu 


Lys 


vai 


Ala 


Pro 


bxy 


Tyr 


Tyr 






:J13 












Ala 


Arg 


p"i .» 


we r, 


v ax 


bXU 


rne 


bin 


GlU 




930 
















Arg 


Val 




Leu 


Pha 

trne 


lie 


Arg 


Pro 


Glu 


945 










you 






Leu 


Val 


Glu 


Aia 


lie 


Asp 


vai 


Pro 


Val 










yoo 










Arg 


Leu 


Val 


Asn 


Ti- 
ne 


Thr 


lie 


lie 


Lys 








980 










985 


Ser 


Phe 


Glu 


Gin 


Pro 


Glu 


Tyr 


Ser 


Val 






995 










1000 


Arg 


He 


Pro 


Val 


He 


Arg 


His 


lie 


Leu 




1010 








101: 


> 




Ser 


Tyr 


Ser 


Thr 


Gin 


Asp 


Asn 


Thr 


Ala 


1025 








1030 






Fro 


Val 


Glu 


Gly 


Glu 


Leu 


Leu 


Phe 


His 








1045 
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irp 


i nr 


biy 


ni y 


oer 


Cys 


7\ _ _ 

ASp 


570 










R7 s 




Mop 


oer 


As n 




oiy 


lie 


^ys 
















Arg 


Lys 


HIS 


bys 


Asn 


bin 


pi 
Gin 








fins 








X JL " 


7A o r\ 


iyr 


oer 


Aid 


ixe 


Leu 
















va i 


bin 


Cys 


bin 


Aia 


Trp 


P T . . 

Gly 




6 ^ ^ 










C A A 

o4 U 


Asp 


Asp 


Cys 


Pro 


Pne 


Lys 


Val 


£ 5 n 
oDU 










c c c 

555 




blU 


vai 


vai 


blU 


Tyr 


Cys 


Ser 










^7 n 
o / u 






Tyr 


Ser 


Tyr 


Asn 


val 


Glu 


Gly 








f^ft ^ 

O 0 o 






v a x 


Leu 


vai 


nlS 


Lys 


T ... 

Lys 


Lys 






7 nr% 










Leu 


lie 


Pro 


Leu 


Leu 


lie 


Phe 




/ ID 










7 7 A 


Leu 


lieu 


Cys 


Trp 


Lys 


Tyr 


Cys 


7 7 n. 
/ JU 










H E 

735 




Leu 


Ova 

rro 


Cys 


Cys 


Asn 


Arg 


Gly 










i c n 

750 






Tyr 


Met 


Leu 


Arg 


Glu 


Asn 


Leu 








7 C C 

/ 00 








Met 


Leu 


Arg 


Ser 


Gly 


Asn 


Leu 






7 Q Ci 
1 0 U 










lie 


Tnr 


Asn 


TV _ _ 

Asn 


Val 


p i 
Gin 


Arg 














Q A A 


inr 


Ser 


Pro 


Tnr 


pi ,, 

GlU 


Leu 


Val 


ft t n 

OIL/ 










QIC 

0 10 




Arg 


Leu 


Cys 


rpu- 

i nr 


GlU 


Asn 


Leu 










Q 7 A 
O JO 






bin 


T At* 

Leu 


Arg 


pi _ 
Gin 


Glu 


Val 


Glu 








845 








val 


Ser 


Gly 


Ala 


His 


Lys 


Leu 






8 60 










Asn 


Ala 


Gly 


Lys 


Lys 


Gin 


Asp 




o7 b 










880 


TV ^ _ 

Ala 


Pro 


Arg 


Ser 


Ala 


Lys 


Gin 


QQA 










O A C 

o 95 




vai 


bXU 


Gin 


pi 
Gly 


Ser 


Phe 


His 










yiu 






Tnr 


Val 


Thr 


Ala 


P T . . 

Glu 


Gin 


Asp 








oo c 






Gly 


val 


GlU 


Leu 


Val 


Asp 


Val 






o a n 

y 3 u 










ASp 


Asp 


Asp 


Glu 


Lys 


Gin 


Leu 




QCC 

? j j 










A A 

960 


oer 


Tnr 


Ala 


Thr 


Leu 


Gly 


Arg 


Q7 n 










y / o 




GlU 


Gin 


Ala 


Ser 


Gly 


Val 


Val 










990 






Ser 


Arg 


Gly 


Asp 


Gin 


Val 


Ala 








1005 






Asp 


Asn 


Gly 


Lys 


Ser 


Gin 


Val 






1020 








His 


Gly 


His 


Arg 


Asp 


Tyr 


Val 




1035 








1040 


Pro 


Gly Glu Thr Trp 


Lys 


Glu 



1050 1035 
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Leu Gin Val Lys Leu Leu Glu Leu Gin Glu Val Asp Ser Leu Leu Arg 

1060 1065 * 1070 

Gly Arg Gin Val Arg Arg Phe Gin Val Gin Leu Ser Asn Pro Lys Phe 

1075 1080 1085 

Gly Ala Arg Leu Gly Gin Pro Ser Thr Thr Thr Val He Leu Asp Glu 

1090 1095 HOO 

Thr Asp Arg Ser Leu He Asn Gin Thr Leu Ser Ser Pro Pro Pro Pro 
1105 1110 His 1120 

His Gly Asp Leu Gly Ala Pro Gin Asn Pro Asn Ala Lys Ala Ala Gly 

1125 H30 ~ 1135 

Ser Arg Lys He His Phe Asn Trp Leu Pro Pro Pro Gly Lys Pro Met 

1140 1145 1150 

Gly Tyr Arg Val Lys Tyr Trp He Gin Gly Asp Ser Glu Ser Glu Ala 

1155 1160 1165 

His Leu Leu Asp Ser Lys Val Pro Ser Val Glu Leu Thr Asn Leu Tyr 

1170 1175 1180 

Pro Tyr Cys Asp Tyr Glu Met Lys Val Cys Ala Tyr Gly Ala Lys Gly 
1185 1190 1195 1200 

Glu Gly Pro Tyr Ser Ser Leu Val Ser Cys Arg Thr His Gin Glu Val 

1205 1210 1215 

Pro Ser Glu Pro Gly Arg Leu Ala Phe Asn Val Val Ser Ser Thr Val 

1220 1225 1230 

Thr Gin Leu Ser Trp Ala Glu Pro Ala Glu Thr Asn Gly Glu He Thr 

1235 1240 1245 

Ala Tyr Glu Val Cys Tyr Gly Leu Val Asn Glu Asp Asn Arg Pro He 

1250 1255 1260 

Gly Pro Met Lys Lys Val Leu Val Asp Asn Pro Lys Asn Arg Met Leu 
1265 1270 1275 1280 

Leu He Glu Asn Leu Arg Asp Ser Gin Pro Tyr Arg Tyr Thr Val Lys 

1285 1290 "* 1295 

Ala Arg Asn Gly Ala Gly Trp Gly Pro Glu Arg Glu Ala He He Asn 

1300 1305 1310 

Leu Ala Thr Gin Pro Lys Arg Pro Met Ser He Pro He He Pro Asp 

1315 1320 1325 

He Pro He Val Asp Ala Gin Gly Gly Glu Asp Tyr Glu Asn Phe Leu 

1330 1335 1340 

Met Tyr Ser Asp Asp Val Leu Arg Ser Pro Ala Ser Ser Gin Arg Pro 
1345 1350 1355 1360 

Ser Val Ser Asp Asp Thr Glu His Leu Val Asn Gly Arg Met Asp Phe 

1365 1370 1375 

Ala Tyr Pro Gly Ser Ala Asn Ser Leu His Arg Met Thr Ala Ala Asn 

1380 1385 1390 

Val Ala Tyr Gly Thr His Leu Ser Pro His Leu Ser His Arg Val Leu 

1395 1400 1405 

Ser Thr Ser Ser Thr Leu Thr Arg Asp Tyr His Ser Leu Thr Arg Thr 

1410 1415 " 1420 

Glu His Ser His Ser Gly Thr Leu Pro Arg Asp Tyr Ser Thr Leu Thr 
1425 1430 1435 1440 

Ser Leu Ser Ser Gin Ala Ser Leu Leu Ser Gly Lys Met Gly Gly Ala 

1445 1450 1455 

Gly Phe Arg Cys Pro Gly Leu Leu Gly Pro 
1460 1465 



<210> 9 
<211> 2205 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secuence; Note = 
synthetic construct 
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<400> 9 

atggcagggc cctgttgcag cccatgggtg aagctgctgc tgctggcacg aatgctgagt 60 

gccagcctcc ctggagacct ggccaaccgc tgcaagaagg ctcaggtgaa gagctgtacc 120 

gagtgcatcc gggtggacaa gagctgtgcc tactgcacag acgagctgtt caaggagagg 180 

cgctgcaaca cccaggcgga cgttctggct gcaggctgca ggggagagag catcctggtc 240 

atggagagca gccttgaaat cacagagaac acccagatcg tcaccagcct gcaccgcagc 300 

caggtatctc cccaaggcct gcaagtccgg ctgcggcggg gtgaggagcg cacgtttgtg 360 

fctccaggtct ttgagcccct ggagagcccc gtggatctgt atatcctcat ggacttctcc 420 

aactccatgt ctgacgatct ggacaacctc aagcagatgg ggcagaacct ggccaagatc 480 

ctgcgccagc tcaccagcga ctacaccatt ggatttggaa agtttgtgga caaagtcagc 540 

gtcccacaga cagacatgag gcccgagaaa ctgaaggagc cctggcccaa cagtgatccc 600 

ccgttctcct tcaagaacgt tatcagctta acggagaatg tggaagaatfc ctggaacaaa 660 

ctgcaaggag aacgcatctc aggcaacctg gacgctcctg aagggggctt tgatgccafcc 720 

ctgcagacag ctgtgtgcac aagggacatt ggctggaggg ctgacagcac ccacctgctg 780 

gtgttctcca ccgagtctgc cttccactac gaggctgatg gtgccaacgt tctggccggc 340 

atcatgaacc gcaatgatga gaaatgccac ctggacgcct cgggcgccta cacccaatac 900 

aagacacagg actacccatc agtgcccacg ctggttcgcc tgcttgccaa gcataacafcc 960 

atccccatct ttgctgtcac caactactct tacagctact atgagaagct ccataagtat 1020 

ttccccgtct cctctctggg cgtcctgcag gaggattcat ccaacatcgt ggagctgctg 1080 

gaggaggcct tctatcgaat tcgctccaac ctggacatcc gggctctgga cagccccaga 1140 

ggcctgagaa cagaggtcac ctccgatact ctccagaaga cggagactgg gtcctttcac 1200 

atcaagcggg gggaagtggg cacatacaat gtgcatctcc gggcagtgga ggacatagat 1260 

gggacacatg tgtgccagct ggctaaagaa gaccaagggg gcaacatcca cctgaaaccc 1320 

tccttctctg atggcctccg gatggacgcg agtgtgatct gtgacgtgtg cccctgtgag 1380 

ctgcaaaagg aagttcgatc agctcgctgt cacttcagag gagacttcat gtgtggacac 1440 

tgtgtgtgca atgagggctg gagtggcaaa acctgcaact gctccaccgg ctctctgagt 1500 

gacacacagc cctgcctgcg tgagggtgag gacaaaccgt gctcgggcca cggcgagtgc 1560 

cagtgcggac gctgtgtgtg ctatggtgaa ggccgctacg agggtcactt ctgcgagtat 1620 

gacaacttcc agtgtccccg gacctctgga ttcctgtgca atgaccgggg acgctgttct 1680 

atgggagagt gtgtgtgtga gcctggttgg acaggccgca gctgcgactg tcccctcagc 1740 

aatgccacct gcatcgatag caacgggggc atctgcaacg gccgaggcta ctgtgagtgt 1800 

ggccgttgtc actgcaacca gcagtcgctc tacacggaca ccacctgtga gatcaactac 18 60 

tctgcgatac tgggtctctg tgaggatctc cgctcctgcg tacagtgcca ggcctggggc 1920 

accggggaga agaaagggcg cgcgtgtgac gattgcccct ttaaagtcaa gatggtagac 1980 

gagcttaaga aagaagaggt ggtggagtac tgctccttcc gggatgagga tgacgactgc 204 0 

acttacagct acaacgtgga gggcgacggc agccctgggc ccaacagcac agtcctggtc 2100 

cacaaaaaga aagactgcct cccggctcct tcctggtggc tcatccccct gctcatcttc 2160 

ctcctgttgc tcctggcgtt gcttctgctg ctctgctgga aatga 2205 

<210> 10 
<211> 734 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note - 
synthetic construct 



<400> 10 



Met 


Ala 


Gly 


Pro 


Cys 


Cys 


Ser 


. Pro 


Trp Val 


Lys 


Leu 


Leu Leu Leu 


Ala 


1 








5 








10 






15 




Arg 


Met 


Leu 


Ser 


Ala 


Ser 


Leu 


Pro 


Gly Asp 


Leu 


Ala 


Asn Arg Cys 


Lys 








20 










25 






30 


Lys 


Ala 


Gin 
35 


Val 


Lys 


Ser 


Cys 


Thr 
40 


Glu Cys 


He 


Arg 


Val Asp Lys 
45 


Ser 


Cys 


Ala 


Tyr 


Cys 


Thr 


Asp 


Glu 


Leu 


Phe Lys 


Glu 


Arg 


Arg Cys Asn Thr 




50 










55 








60 






Gin Ala Asp 


Val 


Leu 


Ala 


Ala 


Gly 


Cys Arg 


Gly Glu 


Ser lie Leu 


Val 


65 










70 






75 






80 


Met 


Glu 


Ser 


Ser 


Leu 
85 


Glu 


He 


Thr 


Glu Asn 
90 


Thr 


Gin 


He Val Thr 
95 


Ser 
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Leu 


HIS 


Arg 


5er 


Gin 


Val 


Ser 


Pro 


Gin 








inn 










lUb 


Arg 


Gly 


blU 


pin 

blU 


Arg 


Thr 


Phe 


Val 


Phe 














ion 




C A V- 


Pro 


val 


Asp 


Leu 


Tyr 


He 


Leu 


Met 




1 jU 










135 






ASp 


Asp 


Leu 


Asp 


Asn 


Leu 


Lys 


Gin 


Met 


145 










150 








Leu 


Arg 


Gin 


Leu 


Thr 


Ser 


Asp 


Tyr 


Thr 










165 








Asp 


Lys 


Val 


Ser 


Val 


Pro 


Gin 


Thr 


Asp 








ion 










185 


Glu 


Pro 


Trp 


Pro 


Asn 


Ser 


Asp 


Pro 


Pro 






195 










200 




o 

oer 


Leu 


Thr 


Glu 


Asn 


Val 


Glu 


Glu 


Phe 




^ 1 u 










215 






Arg 


lie 


Ser 


Gly 


Asn 


Leu 


Asp 


Ala 


Pro 












230 






Leu 


Gin 


Thr 


Ala 


Val 


Cys 


Thr 


Arg 


Asp 










245 










mi 

Thr 


His 


Leu 


Leu 


Val 


Phe 


Ser 


Thr 


Glu 








260 










2 65 


Asp 


Gly 


Ala 


Asn 


Val 


Leu 


Ala 


Gly 


He 






275 










280 




Cys 


His 


Leu 


Asp 


Ala 


Ser 


Gly 


Ala 


Tyr 




290 










295 




Tyr 


Pro 


Ser 


Val 


Pro- 


Thr 


Leu 


Val 


Arg 












310 






lie 


Pro 


lie 


Phe 


Ala 


Val 


Thr 


Asn 


Tyr 










325 








Leu 


His 


Lys 


Tyr 


Phe 


Pro 


Val 


Ser 


Ser 








34 0 










345 


Ser 


Ser 


Asn 


lie 


Val 


Glu 


Leu 


Leu 


Glu 






355 










360 




Ser 


Asn 


Leu 


Asp 


He 


Arg 


Ala 


Leu 


Asp 




370 










375 




Glu 


Val 


Thr 


Ser 


Asp 


Thr 


Leu 


Gin 


Lys 


n a c 

385 










390 






lie 


Lys 


Arg 


Gly 


Glu 


Val 


Gly 


Thr 


Tyr 










405 










Glu 


Asp 


He 


Asp 


Gly 


Thr 


His 


Val 


Cys 








420 










425 


Gly 


Gly 


Asn 


lie 


His 


Leu 


Lys 


Pro 


Ser 






435 










440 




Asp 


Ala 


Ser 


Val 


He 


Cys 


Asp 


Val 


Cys 




4 50 










455 




Val 


Arg 


Ser 


Ala 


Arg 


Cys 


His 


Phe 


Arg 


4 bb 










4 70 






Cys 


Val 


Cys 


Asn 


Glu 


Gly 


Trp 


Ser 


Gly 










485 










Gly 


Ser 


Leu 


Ser 


Asp 


Thr 


Gin 


Pro 


Cys 








500 










505 


Pro 


Cys 


Ser 


Gly 


His 


Gly 


Glu 


Cys 


Gin 






515 










520 






Glu 


Gly 


Arg 


Tyr 


Glu 


Gly 


His 


Phe 




530 










535 






Cys 


Pro 


Arg 


Thr 


Ser 


Gly 


Phe 


Leu 


Cys 


545 










550 






Met 


Gly 


Glu 


Cys 


Val 


Cys 


Glu 


Pro 


Gly 










565 








Cys 


Pro 


Leu 


Ser 


Asn 


Ala 


Thr 


Cys 


lie 








580 








585 
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20 



Gly 


Leu 


Gin 


Val 


Arg 


Leu 


Arg 










110 






Gin 


Val 


Phe 


Glu 


Pro 


Leu 


Glu 








125 








Asp 


Phe 


Ser 


Asn 


Ser 


Met 


Ser 






140 










Gly 


Gin 


Asn 


Leu 


Ala 


Lys 


lie 




155 










160 


He 


Gly 


Phe 


Gly 


Lys 


Phe 


Val 


170 










175 




Met 


Arg 


Pro 


Glu 


Lys 


Leu 


Lys 










190 






Phe 


Ser 


Phe 


Lys 


Asn 


Val 


He 




- 




205 








Trp 


Asn 


Lys 


Leu 


Gin 


Gly 


Glu 






220 










Glu 


Gly 


Gly 


Phe 


Asp 


Ala 


He 




235 








240 


He 


Gly 


Trp 


Arg 


Ala 


Asp 


Ser 


250 










255 




Ser 


Ala 


Phe 


His 


Tyr 


Glu 


Ala 










270 






Met 


Asn 


Arg 


Asn 


Asp 


Glu 


Lys 








285 








Thr 


Gin 


Tyr 


Lys 


Thr 


Gin 


Asp 






300 










Leu 


Leu 


Ala 


Lys 


His 


Asn 


lie 




315 










320 


Ser 


Tyr 


Ser 


Tyr 


Tyr 


Glu 


Lys 


330 










335 


Leu 


Gly 


Val 


Leu 


Gin 


Glu 


Asp 










350 




Glu 


Ala 


Phe 


Tyr 


Arg 


He 


Arg 








365 








Ser 


Pro 


Arg 


Gly 


Leu 


Arg 


Thr 






380 










Thr 


Glu 


Thr 


Gly 


Ser 


Phe 


His 




395 










400 


Asn 


Val 


His 


Leu 


Arg 


Ala 


Val 


410 










415 




Gin 


Leu 


Ala 


Lys 


Glu 


Asp 


Gin 










430 




Phe 


Ser 


Asp 


Gly 


Leu 


Arg 


Met 








445 








Pro 


Cys 


Glu 


Leu 


Gin 


Lys 


Glu 






460 








Gly 


Asp 


Phe 


Met 


Cys 


Gly 


His 




475 










480 


Lys 


Thr 


Cys 


Asn 


Cys 


Ser 


Thr 


4 90 










495 




Leu 


Arg 


Glu 


Gly 


Glu 


Asp 


Lys 










510 




Cys 


Gly 


Arg 


Cys 


Val 


Cys 


Tyr 








C-1C 






Cys 


Glu 


Tyr 


Asp 


Asn 


Phe 


Gin 






540 










Asn 


Asp 


Arg 


Gly 


Arg 


Cys 


Ser 




555 










560 


Tro 


Thr 


Gly 


Arg 


Ser 


Cys 


Asp 


570 










575 


Asp 


Ser 


Asn 


Gly 


Gly 


lie 


Cys 



590 
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Asn Gly 


Arg 
5 95 


Gly 


Tyr 


Cys 


Glu 


Cys 
600 


Gly 


Arg 


Cys 


His 


Cys 
605 


Asn 


Gin 


Gin 


Ser Leu 


Tyr Thr 


Asp* 


Thr 


Thr 


Cys 


Glu 


lie 


Asn 


Tyr 


Ser 


Ala 


He 


Leu 


610 










615 










620 










Gly Leu 


Cys 


Glu 


Asp 


Leu 


Arg 


Ser 


Cys 


Val 


Gin 


Cys 


Gin 


Ala 


Trp 


Gly 


625 








630 










635 










640 


Thr Gly 


Glu 


Lys 


Lys 
645 


Gly 


Arg 


Ala 


Cys 


Asp 
650 


Asp 


Cys 


Pro 


Phe 


Lys 
655 


Val 


Lys Met 


Val 


Asp 
660 


Glu 


Leu 


Lys 


Lys 


Glu 

OOJ 


Glu 


Val 


Val 


Glu 


Tyr 

C "7 A 


Cys 


Ser 


Phe Arg 


Asp 


Glu 


Asp 


Asp 


Asp Cys 


Thr 


Tyr 


Ser 


Tyr 


Asn 


Val 


Glu 


Gly 




675 










680 










685 








Asp Gly 


Ser 


Pro 


Gly 


Pro 


Asn 


Ser 


Thr 


Val 


Leu 


Val 


His 


Lys 


Lys 


Lys 


690 










695 










700 










Asp Cys 


Leu 


Pro 


Ala 


Pro 


Ser 


Trp 


Trp 


Leu 


He 


Pro 


Leu 


Leu 


He 


Phe 


705 








710 










715 










720 


Leu Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Leu 


Cys 


Trp 


Lys 







725 730 



<210> 11 
<211> 3579 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note - 
synthetic construct 

<400> 11 

atgcctgcgc tctggctcag ctgctgcctc ggtgtcgcgc tcctgctgcc cgccagccag 60 

gccacctcca ggagggaagt ctgtgattgc aatgggaagt ccaggcaatg tgtctttgat 120 

caggagctcc atcgacaagc aggcagcggg ttccgttgcc tcaactgcaa tgacaataca 180 

gcgggggttc actgcgagcg gtcgagggag gggttttacc agcatcagag caagagccgc 240 

tgcctaccct gcaactgcca ctcaaagggt tccctcagtg ctggatgtga caactctgga 300 

caatgcaggt gtaagccagg t.gtgacagga caaagatgtg accagtgtca gccaggcttc 360 

catatgctca ccgatgctgg atgcacccga gaccaggggc aactagattc caagtgtgac 420 

tgtgacccag ctggcatctc tggaccctgt gattctggcc gatgtgtctg caaaccagcc 4 80 

gtcactggag agcgctgtga taggtgccga ccacgtgact atcatctgga ccgggcaaac 540 

cctgagggct gtacccagtg tttctgctat gggcattcag ccagctgcca cgcctctgcc 600 

gacttcagtg tccacaaaat cacttcaact ttcagtcagg atgtggatgg ttggaaggcg 660 

gttcagagaa acggggcacc tgcaaaactc cactggtcac agcgccatcg ggacgtgttt 720 

agttctgccc gaagatcaga ccccgtcfcat ttcgtggccc ctgccaaatt cctcggtaac 780 

cagcaagtga gttacgggca gagcctgtct tttgactacc gcgtggacag aggaggtaga 940 

cagccgtctg cctacgatgt gatcctggaa ggtgctggtc tacagatcag agctcctctg 900 

atggctccag gcaagacacfc tccttgtggg atcacaaaga cttacacatt cagactgaat 960 

gaacatccaa gcagtcactg gagtccccag ctgagttatt tcgaatatcg aaggttactg 1020 

cggaacctca cagccctcct gatgatccga gctacgtacg gagaatatag tacagggtac 1030 

attgataacg tgaccctggt ttcagcccgc cctgtccttg gagccccagc cccttgggtt 1140 

gaacgttgtg tatgccfcgct ggggtacaag ggacaattct gccaggaatg tgcttctggt 1200 

cacaaaagag attcggcaag attgggcgct tttggcgcct gtgttccctg taactgccaa 1260 

ggggaggggg cctgtgatcc agacacggga gattgctact cgggggacga gaatcctgac 1320 

attgagtgtg ctgactgtcc catcggtttc tacaatgacc cacatgaccc ccgcagctgc 1380 

aagccatgtc cctgtcacaa tgggttcagc fcgttcagtga tgcctgagac agaggaggtg 1440 

gtgtgtaaca actgtccccc tggggtcaca ggtgcccgct gtgagctctg tgctgatggc 1500 

ttctttgggg atccctttgg ggaacatggc ccagtgaggc cttgtcaacg ctgccaatgc 1560 

aacaacaacg tggaccccaa tgcctctggg aacfcgtgacc agttgacagg cagatgcttg 1620 

aaatgtatct acaacacggc cggtgtctac tgtgaccagt gcaaagcagg ttactttgga 1680 

gacccattgg ctcccaaccc agcagacaag tgtcgagcfcfc gcaactgcag ccccatgggt 1740 

gcggagcctg gagagtgtcg aggtgafcggc agctgtgttt gcaagccagg ctttggcgcc 1800 

ttcaactgtg atcacgcagc cctaaccagt tgtcctgctt gctacaatca agtgaagatt i860 

cagatggacc agtttaccca gcagctccag agcctggagg ccctggtttc aaaggctcag 1920 

ggtggtggtg gtggtggtac agtcccagtg cagctggaag gcaggatcga gcaggctgag 1930 



WO 02/30465 



PCT/US01/32127 



caggccctbc 
ccbgtccggc 
ctcaagabga 
gafcacgagca 
gaaaacacta 
gctcaggagg 
ctagcaaggg 
agtggaggag 
ttagagaaaa 
gaagcfcgata 
ggagtcagtg 
ctctcaaacc 
aactgggaaa 
gatcagctgc 
ggcaatgcca 
caggbtgaag 
cagaaggttg 
actgccgaca 
atagagctgg 
gccatggaga 
agaaaggagc 
cagcaagcbg 
ttggacggca 
ctattagaac 
atgtctgacc 
agcatagatg 
ccaggctgct 



aggacattct 
tggccaaggc 
ctgcagaaag 
gacbcatcbc 
atatccat be 
ctacaagaaa 
aaactgagga 
gcggaagtgg 
ccaagtcccb 
ggtegtatea 
atctgtcctt 
tggtgaccag 
aagaaacacg 
tttcccgtgc 
ctttttatga 
acagaaaagc 
eggatgecag 
cccaacgggc 
agatagggag 
aagggactgc 
tggagtttga 
atgecagage 
tcctacacct 
aagggctttt 
tggaggagag 
gaattcttgc 
acaataccca 



gggagaagct 
aaggagccaa 
gatccgggcc 
teagatgege 
ttcbgagcac 
ggcagacagc 
ctactccaaa 
ctcttgggac 
gagecagcag 
gcacagtctc 
tcaggtggaa 
acaaaeggat 
gcagctbtta 
caaccttgct 
agttgagaac 
agaggctgaa 
fcgacaagacc 
aaagaacgea 
tctgaacttg 
cactctgaag 
caeggataag 
cacgagtgcc 
catagaccag 
ccaagccaag 
ggtgcgtcgg 
tgatgtgaag 
agctcttgag 



22 

cagatbtcag 
cagaacgacb 
cfcgggcagtc 
cbgagbcbgg 
tacgtggggc 
caegctgagb 
caagcacttt 
agetcegbgg 
ctgtcattgg 
cgccbccbgg 
gcaaagagga 
gcabtcacgc 
cagactgg'aa 
aaaaacagag 
atccbgaaga 
gaggecabga 
cagcaagcag 
gctagggagg 
gaagctaatg 
agcgagafcga 
gacacggtgc 
ggagttacca 
cctggcagtg 
acccagatca 
cagaggaacc 
aacctggaga 
caacagtga 



aaggggcaab 
acaagacccg 
agcabcagaa 
caggaagega 
cgaatgatbt 
cagctaacgc 
cattggcccg 
tacaaggtcb 
agggcaccca 
at bcbgcctc 
tcagacaaaa 
gtgtgcgaaa 
aggataggag 
cccaagaagc 
acctccgaga 
agagactctc 
aaacggcccb 
ccctggagafc 
tgacagcaga 
gagagatgab 
agctggtgab 
tccaagacac 
tggatgaaga 
acagtcgacb 
acctccatct 
acabtcgaga 



gagagccgfct 
cctggatgac 
cagagttcag 
agebctcttg 
taaaagtctg 
aabgaagcaa 
caagctebbg 
tatgggaaaa 
agecgacatt 
tcagcttcag 
ggctgattcb 
caatctgggg 
acagacbtca 
gctaagbatg 
gtttgabctg 
ctcbatbagc 
ggggagegee 
cagcagegag 
tggggccttg 
bgagctggcc 
bactgaagee 
rctcaacaca 
agggatgabg 
tcggcccbbg 
gctggagacb 
caacctgccc 



<210> 12 
<211> 1192 
<212> PRT 

<213> Arbificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note 
synthebic construct 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2380 
2940 
3000 
3060 
3120 
3130 
3240 
3300 
3360 
3420 
3480 
3540 
3579 



<400> 12 


















Met Pro 


Ala 


Leu 


Trp 


Leu 


Ser 


Cys Cys 


Leu 


Gly 


Val Ala Leu Leu Leu 


1 






5 






10 




15 


Pro Ala 


Ser 


Gin 


Ala 


Thr 


Ser 


Arg Arg 


Glu 


Val 


Cys Asp Cys Asn Gly 






20 








25 






30 


Lys Ser 


Arg 


Gin 


Cys 


Val 


Phe 


Asp Gin 


Glu 


Leu 


His Arg Gin Ala Gly 




35 










40 






45 


Ser Giy 


Phe 


Arg 


Cys 


Leu 


Asn 


Cys Asn 


Asp 


Asn 


Thr Ala Gly Val His 


50 








55 








60 


Cys Glu 


Arg 


Ser 


Arg 


Glu 


Gly 


Phe Tyr 


Gin 


His 


Gin Ser Lys Ser Arg 


65 








70 








75 


80 


Cys Leu 


Pro 


Cys 


Asn 


Cys 


His 


Ser Lys 


Gly 


Ser 


Leu Ser Ala Gly Cys 








85 








90 




95 


Asp Asn 


Ser 


Gly 


Gin 


Cys 


Arg 


Cys Lys 


Pro 


Gly 


Val Thr Gly Gin Arg 






100 








105 






110 


Cys Asp 


Gin 


Cys 


Gin 


Pro 


Gly 


Phe His 


Met 


Leu 


Thr Asp Ala Gly Cys 




115 










120 






125 


Thr Arg 


Asp 


Gin 


Gly 


Gin 


Leu 


Asp Ser 


Lys 


Cys 


Asp Cys Asp Pro Ala 


130 










135 








140 


Gly lie 


Ser 


Gly 


Pro 


Cys 


Asp 


Ser Gly 


Arg 


Cys 


Val Cys Lys Pro Ala 


145 








150 








155 


160 


Val Thr 


Gly 


Glu 


Arg 


Cys 


Asp 


Arg Cys 


Arg 


Pro 


Arg Asp Tyr His Leu 








165 








170 




175 


Asp Arg 


Ala 


Asn 


Pro 


Glu 


Gly 


Cys Thr 


Gin 


Cys 


Phe Cys Tyr Gly His 






180 








185 






190 
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Ser 


Ala 


Ser 


Cys 






195 






Thr 

J, 1 IX 


Phe 


Ser 




1 i n 

C J. V 






p 1 
oxy 


MIS 


Pro 












Cp r 


oer 


ai a 

nia 


rtxy 


Phta 

- i 1 c 


Leu 


bi y 


Asn 








260 


Tyr 


Arg 


Val 


Asp 






9 7 5 




iJC U 


pi n 


PI u 

oiy 


Aia 










.by s 


inr 


Leu 


Pro 










P1 i- 


MIS 


Pro 


C <a-v 

oer 


Arg 


Arg 


Leu 


T All 

Leu 








340 


Tyr 


Gly 


Glu 


Tyr 










Aia 


Arg 


Pro 


v ai 




■D f \J 






p w e? 
uy s 


Leu 


Leu 


biy 


JO J 








Tyr 


Lys 


Arg 


Asp 


Cys 


Asn 


Cys 


bin 








420 


Tyr 


Ser 


Gly 


Asp 










P 1 \r 

biy 


me 


Tyr 


Asn 




4 Jv 






^ys 


nis 


Asn 


biy 










va j. 


Cys 


Asn 


Asn 


Cys 


A 1 -a 

ax a 


Asp 


biy 








500 


Arg 


Pro 


Cys 


Gin 






D Id 




Ser 


lily 


t\ _ _ 

Asn 


Cys 




D J>U 






Asn 


i nr 


Ala 


biy 










Asp 


r r 0 


Leu 


Ala 


Ser 


rlO 


we u 


biy 








580 


Val 


Cys 


Lys 


Pro 










i nr 


Ser 


Cys 


fro 




610 






?he 


Thr 


Gin 


Gin 


625 








Gly 


Gly 


Gly 


Gly 


Glu 


Gin 


Ala 


Glu 








660 


Ser 


Glu 


Gly 


Ala 



675 



n x o 


HJLa 




Ala 








200 


Gin 


Asp 


Val 


Asp 






C. J. J 




jjy o 


Leu 


nl5 


xrp 




& JU 






a r* rr 


oer 


Asp 


r jl u 


245 










PI n 


\/a 1 
v ai 


C #3 v- 


Arg 


PI \/ 
biy 


PI \r 

biy 


Arg 








280 


Gly 


Leu 


Gin 


He 










Cys 


Pi u 

biy 


lie 


i nr 




JlU 






Car 

oer 


HIS 


Trp 


Ser 


O -J 








Arg 


Asn 


Leu 


mr 


Ser 


inr 


biy 


Tyr 








360 


Leu 


Gly 


Ala 


Pro 






?7 R 




Tyr 


Lys 


biy 


bin 










Ser 


Aia 


Arg 


Leu 


4U J 








biy 


p i i« 
blU 


pi t » 
biy 


Ala 


pi 

blU 


7\ e» *n 

Asn 


Pro 


TV c— » 

ASp 








440 


Asp 


Pro 


His 


Asp 






*i J J 




rne 


Ser 


Cys 


Ser 








Cys 


Pro 


Pro 


PI K 

biy 


4 0 D 








fne 


rne 


P 1 f r 

Gly 


ASp 


Arg 


Cys 


Gin 


Cys 








520 


Asp 


Gin 


Leu 


Thr 






j j j 




val 


Tyr 


Cys 


ASp 










rro 


Asn 


Pro 


Aia 


JUJ 








7\ 1 a 

Aia 


pi i % 

blU 


rro 


biy 


biy 


rne 


Pi 1/ 

biy 


Ala 








600 


Ala 


Cys 


Tyr 


Asn 






ci ^ 
oio 




Leu 


bin 


Ser 


Leu 




630 






Gly 


Gly 


Thr 


Val 


64 5 








Gin 


Ala 


Leu 


Gin 


Met 


Arg 


Ala 


Val 








680 



23 



ASD 




oer 


vdl 


Gly 


irp 


Lys 


Ala 








220 


Ser 


Gin 


Arg 


His 






2 3 R 




v ci J. 


iyr 




Ua 1 
Veil 




250 






Tur 

iyr 


PI V 


Pin 


oer 


£ U _) 








Pi n 


, x O 


oer 


Aia 


Arg 


Ha 
nlfl 


rro 


Leu 








300 


Lys 


Thr 


Tyr 


Thr 






J 10 




Pro 


pi « 
bin 


Leu 


Ser 










Aia 


Leu 


Leu 


wet 


J 








lie 


Asp 


Asn 


\/a 1 

vai 


Aia 


Pro 


Trp 


vai 








380 


Phe 


Cys 


Gin 


Glu 






~> j j 




P1 \r 

vjiy 


Aia 


rne 


PI xr 

biy 




a i n 






Cys 


Asp 


Pro 


ASp 


4 9 ^ 








lie 


pin 

blU 


Cys 


Ala 


Pro 


Arg 


oer 


Cys 








460 


Val 


Met 


Pro 


Glu 






4 7 R 




Va 1 

vai 


inr 


PI u 

biy 


Aia 




^ Qn 






Pro 


rne 


P 1 v# 

biy 


pi M 

blU 


^n r 






Asn 


Asn 


Asn 


val 


biy 


Arg 


Cys 


Leu 








540 


Gin 


Cys 


Lys 


Ala 






J J J 




Asp 


Lys 


Cys 


Arg 










PI n 


Cys 


Arg 


pi \t 
biy 


JO J 








Pho 

cne 


Asn 


Cys 


Asp 


Pin 

bin 


\/a 1 

vai 


Lys 


lie 








620 


Glu 


Ala 


Leu 


Val 






635 




Pro 


Val 


Gin 


Leu 




650 






Asp 


He 


Leu 


Gly 


665 






Ala 


Val 


Arg 


Leu 



His 


Lys 


Tl a 
lie 


Th r 


205 








Val 


VJ X i 1 


a m 


a c n 
noil 


a to 

rtx y 




\/a 1 
v a i 


rne 








240 


Ala 


Pro 


Ala 


Lys 






2 5 5 


T> A 11 


Cpr 


rne 


Asp 




270 




iyr 


Asp 


vai 


lie 


to J 








tie t 


211 a 
Mid 


Pro 


p 1 1/ 
biy 


rne 


Arg 


Leu 


Asn 








320 


Tyr 


Phe 


Glu 


Tyr 










lie 


Arg 


aia 
Ala 


Th v 

i nr 




•sen 






i nr 


Leu 


vai 


Ser 


JUJ 








PI 11 
blU 


Arg 


Cys 


vai 


Cys 


ai a 
Aia 


oer 


pi «f 
biy 








400 


Ala 


Cys 


Val 


Pro 






/lie; 




i nr 


PI \r 

biy 


Asp 


Cys 




4 ^n 

O U 






Asp 


Cys 


rro 


Tl a 

lie 


*1 H D 








Lys 


Pro 


Cys 


Pro 


i nr 


pi n 

blU 


p i ii 

blU 


vai 








480 


Arg 


Cys 


Glu 


Leu 






H i? J 




HIS 


pi \/ 
biy 


rro 


\7a 1 

vai 




J iw 






Asp 


Pro 


Asn 


Aia 


c 9 c 

D £ D 








Lys 


Cys 


T 1 A 

lie 


Tyr 


pi t t 
biy 


Tyr 


rne 


pi u 
biy 








560 


Ala 


Cys 


Asn 


Cys 






ETC 

D f 0 




ASp 


biy 


Ser 


Cys 




590 






HIS 


aia 
Aia 


aia 
Aia 


Leu 










Pin 


Lieu 


ASp 


Pin 
oin 


Ser 


Lys 


Ala 


Gin 






640 


Glu 


Gly 


Arg 


He 






655 




Glu 


Ala 


Gin 


He 




670 






Ala 


Lys 


Ala 


Arg 
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Ser Gin 

*x* C JL w«X* 1 


Glu 


As n 


A 


iyr 


T.w c 
xry o 


Thr 


Arg 


X)C U 


Asp 


Asp 


Leu 


Lys 


cie c 


i nr 


690 










O J J 








7H0 










AT a PI n 


A rn 


rip 


MX y 


nxa 


Leu 


bj- y 


Car 
OCX 


bin 


n IS 


bin 


As n 


Arg 


va i 


P } r\ 

bin 


705 








' X u 










715 








7 o n 


*»okJ X i IX 


OCX 


Arg 


T>ot i 

725 


Tie 


oer 


PI n 
b J_1 1 


Mo f- 


Arg 
7 ?n 


Leu 


Ser 


Leu 


Aia 


biy 

7 ^ ^ 


Ser 


Gin n "I o 


JUtr U 


XJ*r u 


P") n 


As n 


x nr 


As n 


lie 


HIS 


Ser 


oer 


p i ti 

blU 


HIS 


Tyr 


val 






740 










74 5 








7 ^n 




Gl v Prn 


Jlen 
."on 




Php 


xjy o 


O c XT 


Leu 


Ala 


Pi n 


blU 


Aia 


x nr 


Arg 


Lys 


Aia 




755 










7 60 










765 




Asn Spr 




AT a 
mjl d 






AT a 


MS n 


Ala 
MX a 


Met 


Lys 


bin 


Leu 


Aia 


Arg 


blU 


770 










77 ^ 








780 








Thr m n 

1 i IX O JL U 


Mop 


Tyr 


Ser 


Lys 


pin 

bin 


Aia 


i»eu 


Ser 


Leu 


Ala 


Arg 


Lys 


Leu 


Leu 


785 








7 90 










/ yo 






800 


Car ftl \; 




pi w 
bxy 


PI v 

biy 


C ca t- 

oer 


biy 


oer 


Trp 


Asp 


oer 


C n V 

ser 


val 


Val 


Gin 


Gly 


















Qi r\ 
□ 1U 










815 


XJcU L v jcL, 


pi 

b xy 


Lys 

O 4. u 


Leu 




Lys 


i nr 


Lys 
8 9 ^ 

0 c. D 


oer 


Leu 


Ser 


Gin 


Gin 

0 


Leu 


Ser 


T,on P 1 1 1 


bxy 


Thr- 


Pin 

bin 


Mia 


Asp 


lie 


blU 


Aia 


ASp 


Arg 


Ser 


Tyr 


Gin 


His 




835 










840 








845 






uCX. itcu 


2\ v- rt 

Mx y 


Leu 


Leu 


Asp 


Ser 


Aia 


Ser 


bin 


T ^ * i 

Leu 


bin 


Gly 


Val 


Ser 


Asp 


O J \J 










pec 










o bU 






LeU S©x" 


cue 


bin 


va jl 


C 1 n 
blU 


Aia 


Lys 


Arg 


lie 


Arg 


Gin 


Lys 


Ala 


Asp 


Ser 










0 t V 










QIC 

o / b 






880 


xieu oer 


Asn 


Leu 


vai 


inr 


Arg 


Gin 


Tnr 


Asp 


Ala 


Pne 


Thr 


Arg 


Val 


Arg 








8 fl ^ 
0 0 3 










o y u 










895 


noil rtin 


xjeu 


qnn 


Asn 


Trp 


pi ii 

blU 


Lys 


pi ,, 

blU 


inr 


Arg 


Gin 


Leu 


Leu 
910 


Gin 


Thr 


P 1 \/ T> v/ c 
\3j*y u y o 


nil) 


Mx y 


Arg 


Pin 

b j.n 


i nr 


Car* 

oer 


Asp 


bin 


Leu 


Leu 


O n w 

ser 


Arg 


Ala 


Asn 




y J» J 




















925 






Leu Ala 


Lys 


Asn 


Arg 




bin 


blU 


Ala 


Leu 


Ser 


Met 


Gly 


Asn 


Ala 


Thr 


^ J w 










j 










y 4 u 








rne lyr 


p i n 

blU 


va J. 


blU 


Asn 


lie 


Leu 


Lys 


Asn 


Leu 


Arg 


Glu 


Phe 


Asp 


Leu 










q c n 

you 










955 






960 


Gin Val 


Glu 


Asp 


Arg 


Lys 


A^a 


Glu 


Ala 


Glu 


Glu 


Ala 


Met 


Lys 


Arg 


Leu 








y OD 










Q *7 A 








975 




Ser Ser 


lie 


Car 


bin 


Lys 


Val 


Ala 


Asp 


Ala 


Ser 


Asp 


Lys 


Thr 


Gin 


Gin 






980 










985 






990 






Ala Glu 


Thr 


Ala 


Leu 


Gly 


Ser 


Ala 


Thr 


Ala 


Asp 


Thr 


Gin 


Arg 


Ala 


Lys 




995 










1000 






1005 




Asn Ala 


Ala 


Arg 


Glu 


Ala 


Leu 


Glu 


He 


Ser 


Ser 


Glu 


He 


Glu 


Leu 


Glu 


1010 








1015 








1020 








He Gly 


Ser 


Leu 


Asn 


Leu 


Glu 


Ala 


Asn 


Val 


Thr 


Ala Asp Gly 


Ala 


Leu 


1025 








1030 








1035 








1040 


Ala Met 


Glu 


Lys 


Gly 


Thr 


Ala 


Thr 


Leu 


Lys 


Ser Glu Met Arg 


Glu 


Met 



1045 1050 1055 



He Glu Leu Ala Arg Lys Glu Leu Glu Phe Asp Thr Asp Lys Asp Thr 

1060 1065 1070 

val Gin Leu Val He Thr Glu Ala Gin Gin Ala Asp Ala Arg Ala Thr 

1075 1080 1085 

Ser Ala Gly Val Thr He Gin Asp Thr Leu Asn Thr Leu Asp Gly He 

1090 1095 1100 

Leu His Leu He Asp Gin Pro Gly Ser Val Asp Glu Glu Gly Met Met 
1105 mo ins * 1120 

Leu Leu Glu Gin Gly Leu Phe Gin Ala Lys Thr Gin He Asn Ser Arg 

1125 H30 1135 

Leu Arg Pro Leu Met Ser Asp Leu Glu Glu Arg Val Arg Arg Gin Arg 

1140 1145 1150 

Asn His Leu His Leu Leu Glu Thr Ser He Asp Gly He Leu Ala Asp 

1155 1160 ~ H65 

Val Lys Asn Leu Glu Asn He Arg Asd Asn Leu Pro Pro Gly Cys Tyr 
1170 1175 ~ H80 
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Asn Thr Gin Ala Leu Glu Gin Gin 



<210> 13 
<211> 3582 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<400> 13 

atgcctgcgc tctggctggg ctgctgcctc tgcttctcgc tcctcctgcc cgcagcccgg 60 

gccacctcca ggagggaagt ctgtgattgc aatgggaagt ccaggcagtg tatctttgat 120 

cgggaacttc acagacaaac tggtaatgga ttccgctgcc tcaactgcaa tgacaacact 180 

gatggcattc actgcgagaa gtgcaagaat ggcttttacc ggcacagaga aagggaccgc 240 

tgtttgccct gcaattgtaa ctccaaaggt tctcttagtg ctcgatgtga caactctgga 300 

cggtgcagct gtaaaccagg tgtgacagga gccagatgcg accgatgtct gccaggcttc 360 

cacatgctca cggatgcggg gtgcacccaa gaccagagac tgctagactc caagtgtgac 420 

tgtgacccag ctggcatcgc agggccctgt gacgcgggcc gctgtgtctg caagccagct 480 

gttactggag aacgctgtga taggtgtcga tcaggttact ataatctgga tggggggaac 540 

cctgagggct gtacccagtg tttctgctat gggcattcag ccagctgccg cagctctgca 600 

gaatacagtg tccataagat cacctctacc tttcatcaag atgttgatgg ctggaaggct 660 

gtccaacgaa atgggtctcc tgcaaagctc caatggtcac agcgccatca agatgtgttt 720 

agctcagccc aacgactaga ccctgtctat tttgtggctc ctgccaaatt tcttgggaat 780 

caacaggtga gctatgggca aagcctgtcc tttgactacc gtgtggacag aggaggcaga 840 

cacccatctg cccatgatgt gattctggaa ggtgctggtc tacggatcac agctcccttg 900 

atgccacttg gcaagacact gccttgtggg ctcaccaaga cttacacatt caggttaaat 960 

gagcatccaa gcaataattg gagcccccag ctgagttact ttgagtatcg aaggttactg 1020 

cggaatctca cagccctccg catccgagct acatatggag aatacagtac tgggtacatt 1080 

gacaatgtga ccctgatttc agcccgccct gtctctggag ccccagcacc ctgggttoaa 1140 

cagtgtatat gtcctgttgg gtacaagggg caattctgcc aggattgtgc ttctggctac 1200 

aagagagatt cagcgagact ggggcctttt ggcacctgta ttccttgtaa ctgtcaaggg 1260 

ggaggggcct gtgatccaga cacaggagat tgttattcag gggatgagaa tcctgacatt 1320 

gagtgtgctg actgcccaat tggtttctac aacgatccgc acgacccccg cagctgcaag 1380 

ccatgtccct gtcataacgg gttcagctgc tcagtgatgc cggagacgga ggaggtggtg 14 4 0 

tgcaataact gccctcccgg ggtcaccggt gcccgctgtg agctctgtgc tgatggctac 1500 

tttggggacc cctttggtga acatggccca gtgaggcctt gtcagccctg tcaatgcaac 1560 

aacaatgtgg accccagtgc ctctgggaat tgtgaccggc tgacaggcag gtgtttgaag 1620 

tgtatccaca acacagccgg catctactgc gaccagtgca aagcaggcta cttcggggac 1630 

ccattggctc ccaacccagc agacaagtgt cgagcttgca actgtaaccc catgggctca 17*0 

gagcctgtag gatgtcgaag tgatggcacc tgtgtttgca agccaggatt tggtggcccc 1800 

aactgtgagc atggagcatt cagctgtcca gcttgctata atcaagtgaa gattcagatg -860 

gatcagttta tgcagcagci: fccagagaatg gaggccctga tttcaaaggc tcagggtggt 1920 

gatggagtag tacctgatac agagctggaa ggcaggatgc agcaggctga gcaggccctt 1980 

caggacattc tgagagatgc ccagatttca gaaggtgcta gcagatccct tggtctccag 204 0 

ttggccaagg tgaggagcca agagaacagc taccagagcc gcctggatga cctcaagatg 2100 

actgtggaaa gagttcgggc tctgggaagt cagtaccaga accgagttcg ggatactcac 2160 

aggctcatca ctcagatgca gctgagcctg gcagaaagtg aagcttcctt gggaaacact 2220 

aacartcctg cctcagacca ctacgtgggg ccaaatggct ttaaaagtct ggctcaggag 2280 

gccacaagat tagcagaaag ccacgttgag tcagccagta acatggagca actgacaagg 2340 

gaaactgagg acfcatfcccaa acaagccctc tcactggtgc gcaaggccct gcatgaagga 2400 

gtcggaagcg gaagcggtag cccggacggt gctgtggtgc aagggcttgt ggaaaaattg 24 60 

gagaaaacca agtccctggc ccagcagttg acaagggagg ccactcaagc ggaaattgaa 2520 

gcagataggt cttatcagca cagtctccgc ctcctggatt cagtgtctcg gcttcaggga 2580 

gfccagtgatc agtcctttca ggtggaagaa gcaaagagga tcaaacaaaa agcggattca 2640 

ctcccaacgc tggtaaccag gcafcatggat gagtfccaagc gtacacaaaa gaatctggga 2700 

aac-ggaaag aagaagcaca gcagctctta cagaatggaa aaagtgggag agagaaatca 2760 

gatcagctgc tttcccgtgc caatcttgcfc aaaagcagag cacaagaagc actgagtatg 2820 

ggcaatgcca ctfctttatga agttgagagc atccttaaaa acctcagaga gtttgacctg 2830 

caggtggaca acagaaaagc agaagctgaa gaagccatga agagactctc ctacatcagc 2940 
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cagaaggttfc cagatgccag tgacaagacc cagcaagcag aaagagccct ggggagcgct 3000 

gctgctgafcg cacagagggc aaagaafcggg gccggggagg ccctggaaat ccccagtgag 3060 

attgaacagg agattgggag tcfcgaactcg gaagccaatg tgacagcaga tggagccttg 3120 

gccatggaaa agggactggc ctctctgaag agtgagatga gggaagtgga aggagagctg 3180 

gaaaggaagg agctggagtt hgacacgaafc atggatgcag tacagatggt gattacagaa 3240 

gcccagaagg ttgataccag agccaagaac gctggggtta caatccaaga cacactcaac 3300 

acatbagacg gcctcctgca tctgatggac cagcctctca gtgtagatga agaggggctg 3360 

gtcttactgg agcagaagct ttcccgagcc aagacccaga tcaacagcca actgcggccc 3420 

atgatgtcag agctggaaga gagggcacgt cagcagaggg gccacctcca tttgctggag 3480 

acaagcatag atgggattct ggctgahgtg aagaacttgg agaacattag ggacaacctg 3540 

cccccaggct gctacaatac ccaggctctt gagcaacagt ga ^ ~ 3582 

<210> 14 
<211> 1193 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 14 
























Mia H 

Me c 


Pro Ala 


Leu 


Trp 


Leu 


Gly 


Cys 


Cys 


Leu 


Cys 


Phe 


Ser Leu 


Leu Leu 


1 








r 

3 










10 








15 


rlO 


Ala 


Ala 


Arg 


Ala 


Thr 


Ser 


Arg 


Arg 
25 


Glu 


Val 


Cys 


Asp Cys 
30 


Asn Gly 


Tin-* 

Lys 


Ser 


Arg 


Gin 


Cys 


He 


Fne 


Asp 


Arg 


Glu 


Leu 


His 


Arg Gin 


Thr Gly 






35 










40 










45 


Asn 


Gly 


Phe 


Arg 


Cys 


Leu 


Asn 


Cys 


Asn 


Asp 


Asn 


Thr 


Asp Gly 


He His 




50 










55 










60 




Cys 


Glu 


Lys 


Cys 


Lys 


Asn 


Gly 


Phe 


Tyr 


Arg 


His 


Arg 


Glu Arg 


Aso Arg 


£ C 
03 










10 










75 




80 


Cys 


Leu 


Pro 


Cys 


Asn 

03 


Cys 


Asn 


Ser 


Lys 


Gly 
90 


Ser 


Leu 


Ser Ala 


Arg Cys 
95 


Asp 


Asn 


Ser 


Gly 


Arg 


Cys 


Ser 


Cys 


Lys 


Pro 


Gly 


Val 


Thr Gly Ala Arg 








100 








105 








110 




Cys 


Asp Arg 


Cys 


Leu 


Pro 


Gly 


Phe 


His 


Met 


Leu 


Thr 


Asp Ala 


Gly Cys 






115 










120 










125 


Thr 


Gin 


Asp 


Gin 


Arg 


Leu 


Leu 


Asp 


Ser 


Lys 


Cys 


Asp 


Cys Asp 


Pro Ala 




130 










135 










140 




Gly 


He 


Ala 


Gly 


Pro 


Cys 


Asp 


Ala 


Gly 


Arg 


Cys 


val 


Cys Lys 


Pro Ala 


145 










150 










155 






160 


Val 


Thr Gly 


Glu 


Arg 


Cys 


Asp 


Arg 


Cys 


Arg 


Ser 


Gly 


Tyr Tyr Asn Leu 










165 










170 








175 


Asp 


Gly Gly 


Asn 


Pro 


Glu 


Gly 


Cys 


Thr 


Gin 


Cys 


Phe 


Cys Tyr Gly His 








180 










185 








190 




Ser 


Ala 


Ser 


Cys 


Arg 


Ser 


Ser 


Ala 


Glu 


Tyr 


Ser 


Val 


His Lys 


He Thr 






195 










200 








205 




Ser 


Thr 


Phe 


His 


Gin 


Asp 


Val 


Asp 


Gly 


Trp 


Lys 


Ala 


Val Gin Arg Asn 




210 










215 






220 






Gly 


Ser 


Pro 


Ala 


Lys 


Leu 


Gin 


Trp 


Ser 


Gin 


Arg 


His 


Gin Asp Val Phe 


225 










230 










235 






240 


Ser 


Ser 


Ala 


Gin 


Arg 
245 


Leu 


Asp 


Pro 


Val 


Tyr 
250 


Phe 


Val 


Ala Pro 


Ala Lys 
255 


Phe 


Leu 


Gly 


Asn 


Gin 


Gin 


Val 


Ser 


Tyr 
265 


Gly 


Gin 


Ser 


Leu Ser 


Phe Asp 








260 
















270 


Tyr 


Arg 


Val 
275 


Asp 


Arg 


Gly 


Gly 


Arg 
280 


His 


Pro 


Ser 


Ala 


His Asp 
285 


Val He 


Leu 


Glu 


Gly 


Ala 


Gly 


Leu 


Arg 


He 


Thr 


Ala 


Pro 


Leu 


Met Pro 


Leu Gly 




290 










295 










300 




Lys 


Thr 


Leu 


Pro 


Cys 


Gly 


Leu 


Thr 


Lys 


Thr 


Tyr 


Thr 


Phe Arg 


Leu Asn 


305 










310 










315 




320 
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Glu 


His 


Pro 


Ser 


Asn Asn 


T rn 


Ser 


Pro 










325 








Arg Arg 


Leu 


Leu 


Arg Asn 


Leu 


Thr 


Ala 








340 








345 


Gly Glu 


Tyr 


Ser 


Thr Gly 


Tyr 


He 


7\cn 






355 








360 




Arg 


Pro 


Val 


Ser Gly Ala 


Pro 


Ala 




370 








375 






Pro 


Val 


Gly Tyr Lys Gly Gin 


Phe 




385 








390 








Lys 


Arg 


Asp 


Ser 


Ala Arg 


Leu Gly 


Pro 










405 








Asn 


Cys 


Gin Gly Gly Gly Ala 


Cys 


Asp 








420 








4 1 R 


Ser Gly 


Asp 


Glu 


Asn Pro 


Asp 


He 


vjJLU 






435 








440 




Phe 


Tyr 


Asn 


Asp 


Pro His 


Asp 


Pro 


Arg 




450 








455 






His 


Asn 


Gly 


Phe 


Ser Cys 


Ser 


Val 


Met 


465 






470 








Cys 


Asn 


Asn 


Cys 


Pro Pro 


Gly Val 


Tnr 










485 








Ala 


Asp 


Gly Tyr 


Phe Gly Asp 


Pro 


fne 








500 








r n c 
DUO 


Pro 


Cys 


Gin 


Pro 


Cys Gin 


Cys 


Asn 


Asn 






515 








520 




Gly Asn 


Cys 


Asp 


Arg Leu 


Thr 


Gly 


Arg 




530 








535 






Thr 


Ala 


Gly 


He 


Tyr Cys 


Asp 


Gin 


Cys 


545 








550 








Pro 


Leu 


Ala 


Pro 


Asn Pro Ala Asp 


Lys 










565 








Pro 


Met 


Gly 


Ser 


Glu Pro 


Val 


Gly 


Cys 








580 








cat 


Cys 


Lys 


Pro Gly Phe Gly Gly 


Pro 


7\ ^ 

Asn 






595 








600 




Cys 


Pro 


Ala 


Cys 


Tyr Asn 


Gin 


Val 


Lys 




610 








615 






Gin 


Gin 


Leu Gin Arg Met 


Glu 


Ala 


iteu 


625 








630 








Asp 


Gly 


Val 


Val 


Pro Asp 


Thr 


Glu 


Leu 










645 








Glu 


Gin 


Ala 


Leu 


Gin Asp 


lie 


Leu 


Arg 








660 








a c c; 
000 


Ala 


Ser 


Arg 


Ser Leu Gly 


Leu 


Gin 


Leu 






67 5 








680 




Asn 


Ser 


Tyr Gin Ser Arg 


Leu Asp 


Asp 




690 








695 






Val Arg 


Ala 


Leu Gly Ser 


Gin Tyr 


Gin 


705 








710 








Arg 


Leu 


lie 


Thr 


Gin Met 


Gin 


Leu 


Ser 










725 








Leu 


Gly 


Asn 


Thr 


Asn He 


Pro 


Ala 


Ser 








740 








745 


Gly 


Phe 


Lys 


Ser 


Leu Ala 


Gin 


Glu 


Ala 






755 








760 




Val 


Glu 


Ser 


Ala 


Ser Asn 


Met 


Glu 


Gin 




770 








775 






Tyr 


Ser 


Lys 


Gin 


Ala Leu 


Ser 


Leu 


val 


785 








790 








val Gly 


Ser Gly 


Ser Gly 


Ser 


Pro 


Asp 



805 
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Gin Tipn 


OCJl 


Tyr 


Phe 
cue 


Glu 


Tvr 


330 








335 




T »P \\ H y n 


Tip 


Arg 


Ala 


Thr 










350 






a c n Va 1 
t\s it v dl 


Thr 


Leu 


Hp 

-I. JL C 


Ser 


x d 






365 








lip vdl 


UlU 




Pi;e 


lie 






J O v 










bin fi5p 


k^ys 




Q o r* 


ijj.y 


T \/ y* 

iyr 


-3 J J 










4 nn 


fne uiy 


Thr 

i nr 


Cys 


-Lie 


irro 


Cys 


4 10 
4 1U 








4 1J 




CIO Asp 


inr 


v: j.y 


Asp 


Cys 


Tyr 








430 

1 «J V 






Cys AJLa 


Asp 


Cys 


Pro 


i±e 


fti \/ 
oiy 






4 4^ 
4 4 J 








Ser Cys 


Lys 


Pro 


Cys 


Pro 


Cys 




460 










Pro Glu 


inr 


blU 


U1U 


vax 


vai 


4/0 










4 p n 

4 0 u 


c?iy Ala 


Arg 


Cys 


blU 




Cys 


4 71/ 








4 J J 




bxy biu 


nlS 


biy 


Pro 


vai 


Arg 








n 

jIw 






Asn vai 


Asp 


Pro 


Ser 


Aia 


Ser 




cot; 

O £ 0 








uys i»eu 


Lys 


Cys 


ixe 


U ■! e 

MIS 


Asn 




04 U 










Lys Ala 




Tyr 


rne 


k?jLy 


Asp 


^ c: 
000 










DDU 


uys Arg 


Hid 


cys 


As n 


Cys 


Asn 


O / V 








^7 ^ 

0 / O 




Arg Ser 


Asp 


biy 


i nr 


Cys 


vai 








o y u 






Cys Glu 


HlS 


Gly 


Ala 


rne 


Ser 






QUO 








lie Gin 


Met 


ASp 


Gin 


Fne 


wee 




con 
o^u 










lie oer 


Lys 


Aia 


vjin 


uiy 




635 










f"4 fi 
O 4 U 


PI n flu 


Arg 


we t. 


win 


uin 


Axa 


OO v 








U J J 




Asp Ala 


Gin 


lie 


Ser 


ii 


biy 






o / u 






Ala Lys 


val 


Arg 


oer 


bin 


vjIU 






OO 0 








Leu Lys 


Met 


Thr 


val 


GlU 


Arg 


f UU 










Asn Arg 


Val 


Arg 


ASp 


Tk -K- 

xnr 


MIS 


TIC 

' 10 










f £.\J 


Leu Ala 


GlU 


Ser 


Pi ii 

Vj±U 


nld 


oer 


' J u 








71^ 




Asp His 


Tyr 


vai 


o-Ly 


Pro 


Asn 






7 ^n 

' O v 






f K V T\ y- ft 

inr Axg 


Leu 


Ala 


OJL U 


Car 






765 








Leu Thr 


Arg 


Glu 


Thr 


Glu 


Asp 




780 










Arg Lys 


Ala 


Leu 


His 


Glu 


Gly 


795 










800 


Gly Ala 


Val 


Val 


Gin 


Gly 


Leu 


310 








815 





WO 02/30465 



PCT/US01/32127 



28 



Val 


Glu 


Lys 


Leu 
820 


Glu 


Lys 


Thr 


Lys 


Ser 
825 


Leu 


Ala 


Gin 


Gin 


Leu 
830 


Thr 


Arg 


Glu 


Ala 


Thr 


Gin 


Ala 


Glu 


He 


Glu 


Ala 


Asp 


Arg 


Ser 


Tyr 


Gin 


His 


Ser 






835 










840 








845 








Leu 


Arc 
850 


Leu 


Leu 


Asp 


Ser 


Val 
855 


Ser 


Arg 


Leu 


Gin 


Gly 
860 


Val 


Ser 


Asp 


Gin 


Ser 


Phe 


Gin 


Val 


Glu 


Glu 


Ala 


Lys 


Arg 


He 


Lys 


Gin 


Lys 


Ala 


Asp 


Ser 


865 










870 










875 










880 


Leu 


Ser 


Thr 


Leu 


Val 
885 


Thr 


Ara 
■** ^ 


His 


Met 


Asp 
890 


Glu 


Phe 


Lys 


Arg 


Thr 
895 


Gin 


Lvs 


Asn 


Leu 


Glv 
900 


Asn 


Tro 


Lvs 


Glu 


Glu 
905 


Ala 


Gin 


Gin 


Leu 


Leu 
910 


Gin 


Asn 


Glv 


Lvs 


Ser 
915 


Glv 


Arg 


Glu 


Lvs 


Ser 
920 


Asd 


Gin 


Leu 


Leu 


Ser 
925 


Arq 


Ala 


Asn 


Leu 


Ala 


Lys 


Ser 


Arg 


Ala 


Gin 


Glu 


Ala 


Leu 


Ser 


Met 


Gly 


Asn 


Ala 


Thr 




930 






935 










940 










Phe 


T vr 


Glu 


Val 


Glu 


Ser 


He 


Leu 


Lys 


Asn 


Leu 


Ara 


Glu 


Phe 


Asd 


Leu 


945 








950 










955 










960 


Gin 


Val 


Asp 


Asn 


Arg 
965 


Lvs 


Ala 


Glu 


Ala 


Glu 
970 


Glu 


Ala 


Met 


Lys 


Ara 
975 


Leu 


Ser 


Tyr 


He 


Ser 
980 


Gin 


Lys 


Val 


Ser 


Asp 
985 


Ala 


Ser 


Asp 


Lys 


Thr 
990 


Gin 


Gin 


Ala 


Glu 


Arg 


Ala 


Leu 


Gly 


Ser 


Ala 


Ala 


Ala 


Asp 


Ala 


Gin 


Arg 


Ala 


Lys 






995 










1000 








1005 






Asn 


Gly 


Ala 


Gly 


Glu 


Ala 


Leu 


Glu 


He 


Ser 


Ser 


Glu 


He 


Glu 


Gin 


Glu 




1010 






1015 








1020 








He 


Gly Ser 


Leu 


Asn 


Leu 


Glu 


Ala 


Asn 


Val 


Thr 


Ala Asp Gly 


Ala 


Leu 


1025 








1030 








1035 








1040 


Ala 


Met 


Glu 


Lys 


Gly 


Leu 


Ala 


Ser 


Leu 


Lys 


Ser Glu Met Arg 


Glu 


Val 










1045 








1050 








1055 


Glu Gly Glu 


Leu 


Glu Arg Lys 


Glu 


Leu 


Glu 


Phe 


Asp 


Thr 


Asn 


Met Asp 








1060 








1065 








1070 




Ala 


Val 


Gin 


Met 


Val 


He 


Thr 


Glu 


Ala 


Gin 


Lys 


Val Asp 


Thr Arg Ala 



1075 1080 1085 



Lys Asn Ala Gly Val Thr He Gin Asp Thr Leu Asn Thr Leu Asp Gly 

1090 10'95 1100 

Leu Leu His Leu Met Asp Gin Pro Leu Ser Val Asp Glu Glu Gly Leu 
1105 1110 1115 1120 

Val Leu Leu Glu Gin Lys Leu Ser Arg Ala Lys Thr Gin He Asn Ser 

1125 * 1130 1135 

Gin Leu Arg Pro Met Met Ser Glu Leu Glu Glu Arg Ala Arg Gin Gin 

1140 1145 1150 

Arg Gly His Leu His Leu Leu Glu Thr Ser He Asp Gly He Leu Ala 

1155 1160 1165 

Asp Val Lys Asn Leu Glu Asn He Arg Asp Asn Leu Pro Pro Gly Cys 

1170 1175 " " 1180 

Tyr Asn Thr Gin Ala Leu Glu Gin Gin 
1185 1190 

<210> 15 

<211> 1111 

<212> PRT 

<213> Artificial Sequence 
<220> 

<C23> Description of Artificial Sequence/ Mote - 
synthetic construct 

<400> 15 

Met Pro Ala Leu Trp Leu Gly Cys Cys Leu Cys Phe Ser Leu Leu Leu 
15 10 15 
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Ala 


Aia 


Arg 


Ala 


Tnr 


Ser 


fly.- 

Arg 


Arg 


blU 


Val 


Cys 


ASp 


Cys 


Asn 


Gly 








9 n 










Z D 














Lys 




Arg 


ojLn 


cys 


lie 


Phe 


Asp 


Arg 


blU 


Leu 


His 


Arg 


Gin 


Thr 


Gly 






j j 




















a tr 
H J 






Asn 


Gly 


Pne 


Arg 


Cys 


Leu 


Asn 


Cys 


Asn 


Asp 


Asn 


Thr 


Asp 


Gly 


He 


His 




5U 




















bU 










Cys 


Glu 


Lys 


Cys 


Lys 


Asn 


Gly 


Phe 


Tyr 


Arg 


His 


Arg 


Glu 


Arg 


Asp 


Arg 


03 




















ID 








80 


Cys 


Leu 


Pro 


Cys 


Asn 


Cys 


Asn 


Ser 


Lys 


Gly 


Ser 


Leu 


Ser 


Ala 


Arg 


Cys 










o c 
0 D 










90 










95 


ASp 


Asn 


Ser 


Gly 


Arg 


Cys 


Ser 


Cys 


Lys 


Pro 


Gly 


Val 


Thr 


Gly 


Ala 


Arg 








inn 
1UU 










105 










110 






Cys 


Asp 


Arg 


Cys 


Leu 


Pro 


Gly 


Phe 


His 


Met 


Leu 


Thr 


Asp 


Ala 


Gly 


Cys 






115 










120 










125 








Thr 


Gin 


Asp 


Gin 


Arg 


Leu 


Leu 


Asp 


Ser 


Lys 


Cys 


Asp 


Cys 


Asp 


Pro 


Ala 




130 










135 










140 










Gly 


lie 


Ala 


Gly 


Pro 


Cys 


Asp 


Ala 


Gly 


Arg 


Cys 


Val 


Cys 


Lys 


Pro 


Ala 


145 










150 










155 










160 


Val 


Thr 


Gly 


Glu 


Arg 


Cys 


Asp 


Arg 


Cys 


Arg 


Ser 


Gly 


Tyr 


Tyr 


Asn 


Leu 










165 










170 










175 




Asp 


Gly 


Gly 


Asn 


Pro 


Glu 


Gly 


Cys 


Thr 


Gin 


Cys 


Phe 


Cys 


Tyr 


Gly 


His 








180 










185 










190 






Ser 


Ala 


Ser 


Cys 


Arg 


Ser 


Ser 


Ala 


Glu 


Tyr 


Ser 


Val 


His 


Lys 


lie 


Thr 






195 










200 










205 






Ser 


Thr 


Phe 


His 


Gin 


Asp 


Val 


Asp 


Gly 


Trp 


Lys 


Ala 


Val 


Gin 


Arg 


Asn 




<HU 










215 










220 










Gly 


Ser 


Pro 


Ala 


Lys 


Leu 


Gin 


Trp 


Ser 


Gin 


Arg 


His 


Gin 


Asp 


Val 


Phe 


225 










230 










235 










240 


Ser 


Ser 


Ala 


Gin 


Arg 


Leu 


Asp 


Pro 


Val 


Tyr 


Phe 


Val 


Ala 


Pro 


Ala 


Lys 










24 5 










250 










255 




Phe 


Leu 


Gly 


Asn 


Gin 


Gin 


Val 


Ser 


Tyr 


Gly 


Gin 


Ser 


Leu 


Ser 


Phe 


Asp 








260 










265 










270 




Tyr 


Arg 


Val 


Asp 


Arg 


Gly 


Gly 


Arg 


His 


Pro 


Ser 


Ala 


His 


Asp 


Val 


He 






275 










280 










285 








Leu 


Glu 


Gly 


Ala 


Gly 


Leu 


Arg 


He 


Thr 


Ala 


Pro 


Leu 


Met 


Pro 


Leu 


Gly 




290 










295 










300 








Lys 


Thr 


Leu 


Pro 


Cys 


Gly 


Leu 


Thr 


Lys 


Thr 


Tyr 


Thr 


Phe 


Arg 


Leu 


Asn 


305 










310 










315 








320 


Glu 


His 


Pro 


Ser 


Asn 


Asn 


Trp 


Ser 


Pro 


Gin 


Leu 


Ser 


Tyr 


Phe 


Glu 


Tyr 










325 










330 








335 


Arg 


Arg 


Leu 


Leu 


Arg 


Asn 


Leu 


Thr 


Ala 


Leu 


Arg 


He 


Arg 


Ala 


Thr 


Tyr 








340 










345 










350 






Gly 


Glu 


Tyr 


Ser 


Thr 


Gly 


Tyr 


He 


Asp 


Asn 


Val 


Thr 


Leu 


He 


Ser 


Ala 






n c c 

355 










360 










365 








Arg 


Pro 


Val 


Ser 


Gly 


Ala 


Pro 


Ala 


Pro 


Trp 


Val 


Glu 


Gin 


Cys 


lie 


Cys 




1 n r\ 

370 










375 










380 






Pro 


Val 


Gly 


Tyr 


Lys 


Gly 


Gin 


Phe 


Cys 


Gin 


Asp 


Cys 


Ala 


Ser 


Gly 


Tyr 


*5 a c 

385 










390 










395 










400 


Lys 


Arg 


Asp 


Ser 


Ala 


Arg 


Leu 


Gly 


Pro 


Phe 


Gly 


Thr 


Cys 


He 


Pro 


Cys 










405 










410 








415 


Asn 


Cys 


Gin 


Gly 


Gly 


Gly 


Ala 


Cys 


Asp 


Pro 


Asp 


Thr 


Gly 


Asp 


Cys 


Tyr 


















IOC 

4 25 










430 






Ser 


Gly 


Asp 


Glu 


Asn 


Pro 


Asp 


He 


Glu 


Cys 


Ala 


Asp 


Cys 


Pro 


He 


Gly 
















440 










445 






Pne 


Tyr 


7\ _ _ 

Asn 


Asp 


Pro 


His 


Asp 


Pro 


Arg 


Ser 


Cys 


Lys 


Pro 


Cys 


Pro 


Cys 




450 










455 










460 








His 


Asn 


Gly 


Phe 


Ser 


Cys 


Ser 


Val 


Met 


Pro 


Glu 


Thr 


Glu 


Glu 


Val 


Val 


4 65 










470 










475 










480 


Cys 


Asn 


Asn 


Cys 


Pro 


Pro 


Gly 


Val 


Thr 


Gly 


Ala 


Arg 


Cys 


Glu 


Leu 


Cys 










485 










490 






495 


Ala 


Asp 


Gly 


Tyr 


Phe 


Gly 


Asp 


Pro 


Phe 


Gly 


Glu 


His 


Gly 


Pro 


Val 


Arg 








500 










505 








510 
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Pro Cys Gin 
515 

Gly Asn Cys 

530 

Thr Ala Gly 
545 

Pro Leu Ala 

Pro Met Gly 

Cys Lys Pro 
595 

Cys Pro Ala 
610 

Gin Gin Leu 
625 

Asp Gly Val 

Glu Gin Ala 

. Ala Ser Arg 
675 

Asn Ser Tyr 

690 
Val Arg Ala 
705 

Arg Leu lie 

Leu Gly Asn 

Gly Phe Lys 
755 

Val Glu Ser 

770 
Tyr Ser Lys 
785 

Val Gly Ser 

Val Glu Lys 

Glu Ala Thr 
835 

Leu Arg Leu 
850 

Ser Phe Gin 
865 

Leu Ser Thr 

Lys Asn Leu 

Gly Lys Ser 
915 

Leu Ala Lys 

930 
Phe Tyr Glu 
945 

Gin Val Asp 

Ser Tyr lie 

Ala Glu Arg 
995 



Pro Cys Gin 

Asp Arg Leu 

lie Tyr Cys 
550 

Pro Asn Pro 

565 
Ser Glu Pro 
580 

Gly Phe Gly 

Cys Tyr Asn 

Gin Arg Met 
630 

Val Pro Asp 
64 5 

Leu Gin Asp 
660 

Ser Leu Gly 

Gin Ser Arg 

Leu Gly Ser 
710 

Thr Gin Met 
725 

Thr Asn lie 
740 

Ser Leu Ala 

Ala Ser Asn 

Gin Ala Leu 
790 

Gly Ser Gly 

805 
Leu Glu Lys 
820 

Gin Ala Glu 
Leu Asp Ser 



Cys Asn Asn Asn Val Asp 
520 

Thr Gly Arg Cys 
535 

Asp Gin Cys Lys 



Val Glu Glu 
870 

Leu Val Thr 

885 
Gly Asn Trp 
900 

Gly Arg Glu 



Ala Asp Lys Cys 
570 

Val Gly Cys Arg 
585 

Gly Pro Asn Cys 
600 

Gin Val Lys lie 
615 

Glu Ala Leu He 

Thr Glu Leu Glu 
650 

He Leu Arg Asp 
665 

Leu Gin Leu Ala 
680 

Leu Asp Asp Leu 
695 

Gin Tyr Gin Asn 

Gin Leu Ser Leu 
730 

Pro Ala Ser Asp 
745 

Gin Glu Ala Thr 
7 60 

Met Glu Gin Leu 
775 

Ser Leu Val Arg 

Ser Pro Asp Gly 
810 

Thr Lys Ser Leu 
825 

He Glu Ala Asp 
840 

Val Ser Arg Leu 
855 

Ala Lys Arg He 



Leu Lys 
540 
Ala Gly 
555 

Arg Ala 



Arg His Met Asp 
890 

Lys Glu Glu Ala 
905 

Lys Ser Asp Gin 
920 

Ser Arg Ala Gin Glu Ala Leu 
935 

He Leu Lys Asn 



Val Glu Ser 
950 

Asn Arg Lys 
965 

Ser Gin Lys 
980 

Ala Leu Gly 



Ala Glu Ala Glu 
970 

Val Ser Asp Ala 
985 

Ser Ala Ala Ala 
1000 



Ser Asp 

Glu His 

Gin Met 
620 
Ser Lys 
635 

Gly Arg 

Ala Gin 

Lys Val 

Lys Met 
700 
Arg Val 
715 

Ala Glu 

His Tyr 

Arg Leu 

Thr Arg 
780 
Lys Ala 
7 95 

Ala Val 

Ala Gin 

Arg Ser 

Gin Gly 
860 
Lys Gin 
875 

Glu Phe 

Gin Gin 

Leu Leu 

Ser Met 
940 
Leu Arg 
955 

Glu Ala 
Ser Asp 
Asp Ala 



Pro Ser 
525 

Cys He 

Tyr Phe 

Cys Asn 

Gly Thr 
590 
Gly Ala 
605 

Asp Gin 

Ala Gin 

Met Gin 

He Ser 
670 
Arg Ser 
685 

Thr Val 

Arg Asp 

Ser Glu 

Val Gly 
750 
Ala Glu 
765 

Glu Thr 

Leu His 

Val Gin 

Gin Leu 
830 
Tyr Gin 
845 

Val Ser 

Lys Ala 

Lys Arg 

Leu Leu 
910 
Ser Arg 
925 

Gly Asn 



Ala Ser 

His Asn 

Gly Asp 
560 
Cys Asn 
575 

Cys Val 

Phe Ser 

Phe Met 

Gly Gly 
640 
Gin Ala 
655 

Glu Gly 

Gin Glu 

Glu Arg 

Thr His 
720 
Ala Ser 
735 

Pro Asn 

Ser His 

Glu Asp 

Glu Gly 
800 
Gly Leu 
815 

Thr Arg 

His Ser 

Asp Gin 

Asp Ser 
880 
Thr Gin 
895 

Gin Asn 
Ala Asn 
Ala Thr 



Glu Phe 

Met Lys 

Lys Thr 
990 
Gin Arg 
1005 



Asp Leu 
960 
Arg Leu 
975 

Gin Gin 
Ala Lys 
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Asn Gly Ala Gly Glu Ala Leu Glu He Ser Ser Glu Tie Glu Gin Glu 

1010 1015 1020 

He Gly Ser Leu Asn Leu Glu Ala Asn Val Thr Ala Asp Gly Ala Leu 
1025 1030 1035 * 1040 

Ala Met Glu Lys Gly Leu Ala Ser Leu Lys Ser Glu Met Arg Glu Val 

1045 1050 1055 

Glu Gly Glu Leu Glu Arg Lys Glu Leu Glu Phe Asp Thr Asn Met Asp 

1060 1065 1070 

Ala Val Gin Met Val He Thr Glu Ala Gin Lys Val Asp Thr Arg Ala 

1075 1080 1085 

Lys Asn Ala Gly Val Thr He Gin Asp Thr Leu Asn Thr Leu Asp Gly 

1090 1095 1100 

Leu Leu His Leu Met Gly Met 
1105 1110 

<210> 16 
<211> 5622 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note « 
synthetic construct 

<400> 16 

atggcagcgc cacgccccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccag tgaagagctg cacggagtgt 120 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 240 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt tgagctggag 360 

gtgtttgagc cactggagag ccccgtggac ctgtacatcc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 430 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 540 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccttc 600 

tccttcaaga acgtcatcag cct'gacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc catcctgcag 720 

acagctgtgt gcacgaggga cattggctgg cgcccggaca gcacccacct gctggtcttc 7 80 

tccaccgagt cagccttcca ctatgaggct gatggcgcca acgtgctggc tggcatcatg 340 

agccgcaacg atgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgcctgctcg ccaagcacaa catcatcccc 960 

atctttgctg tcaccaacta ctcctatagc tactacgaga agcttcacac ctatttccct 1020 

gtctcctcac tgggggtgct gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgctc caacctggac atccgggccc tagacagccc ccgaggcctt 1140 

cggacagagg tcacctccaa gatgfctccag aagacgagga ctgggtcctt tcacatccgg 1200 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 1260 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 14 4 0 

agcgagggct ggagtggcca gacctgcaac tgctccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcggg 1560 

cactgtgtgt gctacggcga aggccgctac gagggtcagt tctgcgagta tgacaacttc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1630 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc acfcgtgagtg tggccgctgc 1800 

cactgccacc agcagtcgct ctacacggac accatctgcg agatcaacta ctcggcgtcc I860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 2040 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc -ggtggctca tccccctgct cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 
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tgcctggcac tfcctcccgtg ctgcaaccga ggtcacatgg tgggctttaa ggaagaccac 2280 

tacafcgctgc gggagaacct gatggcctct gaccacttgg acacgcccat gctgcgcagc 2340 

gggaacctca agggccgtga cgtggtccgc fcggaaggtca ccaacaacat gcagcggccfc 2400 

ggctttgcca ctcatgccgc cagcatcaac cccacagagc tggtgcccta cgggctgfccc 2460 

fctgcgcctgg cccgcctttg caccgagaac ctgctgaagc ctgacactcg ggagtgcgcc 2520 

cagctgcgcc aggaggtgga ggagaaccfcg aacgaggtct acaggcagat ctccggtgta 2580 

cacaagctcc agcagaccaa gttccggcag cagcccaafcg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gatggcgccc cgctcggcca agccggccct gctgaagctt 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggcfcactac 2760 

accctcactg cagaccagga cgcccggggc afcggtggagt tccaggaggg cgtggagcfcg 2820 

gtggacgtac gggtgcccct ctttatccgg cctgaggatg acgacgagaa gcagctgctg 2890 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggfcaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctttg agcagcctga gttctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagctc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgtc 3240 

cagctcagca accctaagtt tggggcccac ctgggccagc cccactccac caccatcatc 3300 

atcagggacc cagatgaact ggaccggagc ttcacgagtc agatgttgtc atcacagcca 3360 

ccccctcacg gcgacctggg cgccccgcag aaccccaatg ctaaggccgc tgggtccagg 3420 

aagatccatt tcaactggct gcccccttct ggcaagccaa tggggtacag ggtaaagtac 3480 

tggattcagg gtgactccga atccgaagcc cacctgctcg acagcaaggt gccctcagtg 3540 

gagctcacca acctgtaccc gtattgcgac tatgagatga aggtgtgcgc ctacggggct 3600 

cagggcgagg gaccctacag ctccctggtg tcctgccgca cccaccagga agtgcccagc 3660 

gagccagggc gtctggcctt caatgtcgtc tcctccacgg tgacccagct gagctgggct 3720 

gagccggctg agaccaacgg tgagatcaca gcctacgagg tctgctatgg cctggfccaac 3780 

gatgacaacc gacctattgg gcccatgaag aaagtgctgg ttgacaaccc taagaaccgg 3840 

atgctgctta ttgagaacct tcgggagtcc cagccctacc gctacacggb gaaggcgcgc 3900 

aacggggccg gctgggggcc tgagcgggag gccatcatca acctggccac ccagcccaag 3960 

aggcccatgt ccatccccat catccctgac atccctatcg tggacgccca gagcggggag 4020 

gactacgaca gcttccttat gtacagcgat gacgttctac gctctccatc gggcagccag 4080 

aggcccagcg tctccgatga cactggctgc ggctggaagt tcgagccccfc gctgggggag 4140 

gagctggacc tgcggcgcgt cacgtggcgg ctgcccccgg agctcatccc gcgcctgtcg 4200 

gccagcagcg ggcgctcctc cgacgccgag gcgccccacg ggcccccgga cgacggcggc 4260 
gcgggcggga agggcggcag cctgccccgc agtgcgacac ccgggccccc cggagagcac . 4 320 

ctggtgaatg gccggatgga ctttgccttc ccgggcagca ccaactccct gcacaggatg 4 380 

accacgacca gtgctgctgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4 4 40 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4 500 

cactcgacca cactgccgag ggactactcc accctcacct ccgtctcctc ccacggcctc 4560 

cctcccatct gggaacacgg gaggagcagg cttccgctgt cctgggccct ggggtcccgg 4 620 

agbcgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc tataatcctg 4 680 

gctgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4 740 

gacacgccca cccgcctggt gttctctgcc ctggggccca catctctcag agtgagctgg 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 4860 

ggcggtgagc bgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4 920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4 980 

ggccgagagc gtgagggtgt catcaccatt gaatcccagg tgcacccgca gagcccactg 5040 

tgtcccctgc caggcfcccgc cttcactfctg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 5160 

ggggatatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg tccagccacc 5220 

gcatcccggg fcggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5280 

aacgcgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 5340 

gagggcatca tcaccataga gtcccaggafc ggaggtccct tcccgcagct gggcagccgt 5400 

gccgggcfcct tccagcaccc gctgcaaagc gagtacagca gcatctccac cacccacacc 5460 

agcgccaccg agcccfctcct agtgggcccg accctggggg cccagcacct ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag fcttgtgagcc ggacactgac caccagcgga 5530 

acccttagca cccacatgga ccaacagttc ttccaaacfct ga ~ ^ 5622 



<210> 17 
<211> 1873 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<400> 17 

Met Ala Gly Pro Arg Pro Ser Pro Trp Ala Arg Leu Leu Leu Ala Ala 

1 " 5 10 15 

Leu He Ser Val Ser Leu Ser Gly Thr Leu Asn Arg Cys Lys Lys Ala 

20 25 30 

Pro He Lys Ser Gys Thr Glu Cys Val Arg Val Asp Lys Asp Cys Ala 

35 40 45 

Tyr Cys Thr Asp Glu Met Phe Arg Asp Arg Arg Cys Asn Thr Gin Ala 

50 55 60 

Glu Leu Leu Ala Ala Gly Cys Gin Arg Glu Ser He Val Val Met Glu 
65 70 75 80 

Ser Ser Phe Gin He Thr Glu Glu Thr Gin He Asp Thr Thr Leu Arg 

85 90 95 

Arg Ser Gin Met Ser Pro Gin Gly Leu Arg Val Arg Leu Arg Pro Gly 

100 105 110 

Glu Glu Arg His Phe Glu Leu Glu Val Phe Glu Pro Leu Glu Ser Pro 

115 120 125 

Val Asp Leu Tyr He Leu Met Asp Phe Ser Asn Ser Met Ser Asp Asp 

130 135 140 

Leu Asp Asn Leu Lys Lys Met Gly Gin Asn Leu Ala Arg Val Leu Ser 
145 ' 150 155 160 

Gin Leu Thr Ser Asp Tyr Thr lie Gly Phe Gly Lys Phe Val Asp Lys 

165 170 175 

Val Ser Val Pro Gin Thr Asp Met Arg Pro Glu Lys Leu Lys Glu Pro 

180 185 190 

Trp Pro Asn Ser Asp Pro Pro Phe Ser Phe Lys Asn Val He Ser Leu 

195 * 200 205 

Thr Glu Asp Val Asp Glu Pl>e Arg Asn Lys Leu Gin Gly Glu Arg He 

210 215 220 

Ser Gly Asn Leu Asp Ala Pro Glu Gly Gly Phe Asp Ala He Leu Gin 
225 230 235 240 

Thr Ala Val Cys Thr Arg Asp He Gly Trp Arg Pro Asp Ser Thr His 

245 250 255 

Leu Leu Val Phe Ser Thr Glu Ser Ala Phe His Tyr Glu Ala Asp Gly 

260 265 270 

Ala Asn Val Leu Ala Gly He Met Ser Arg Asn Asp Glu Arg Cys His 

275 280 285 

Leu Asp Thr Thr Gly Thr Tyr Thr Gin Tyr Arg Thr Gin Asp Tyr Pro 

290 295 300 

Ser Val Pro Thr Leu Val Arg Leu Leu Ala Lys His Asn He lie Pro 
305 310 315 320 

He Phe Ala Val Thr Asn Tyr Ser Tyr Ser Tyr Tyr Glu Lys Leu His 

325 330 335 

Thr Tyr Phe Pro Val Ser Ser Leu Gly Val Leu Gin Glu Asp Ser Ser 

340 345 350 

Asn He Val Glu Leu Leu Glu Glu Ala Phe Asn Arg He Arg Ser Asn 

355 360 365 

Leu Asp He Arg Ala Leu Asp Ser Pro Arg Gly Leu Arg Thr Glu Val 

370 375 " 380 

Thr Ser Lys Met Phe Gin Lys Thr Arg Thr Gly Ser Phe His He Arg 
385 390 395 400 

Arg Gly Glu Val Gly He Tyr Gin Val Gin Leu Arg Ala Leu Glu His 

405 410 415 

Val Asp Gly Thr His Val Cys Gin Leu. Pro Glu Asp Gin Lys Gly Asn 
420 425 430 
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He 


His 


Leu 


Lys 






435 




He 


He 


Cys 


Asp 




450 






Ala 


Ara; 


Cys 


Ser 


465 








Ser 


Glu 


Gly 


Trp 


Ser 


Asp 


lie 


Gin 








500 


Gly 


Arg 


Gly 


Glu 






515 




Arg 


Tur 


Glu 


Gly 




530 






Thr 


Ser 

OCX 


Gly 


Phe 


545 








Cvs 


Val 


u y 3 


Glu 


Ser 


Asn 


Ala 


Thr 








580 


Gly 


His 


Cys 


Glu 






595 




Thr 


Asp 


Thr 


He 




610 






Ala 


Arg 


Thr 


x yx 


625 








Arg 


Arg 


Arg 


Glv 

oxy 


Trn 


Thr 


Ser 


Leu 








660 


Gly 


Thr 


Arg 


Met 






675 




Ala 


Pro 


Leu 


Glv 
oxy 




690 






Cys 


Pro 


Pro 


Glv 


7D5 








Leu 


Pro 


XI C u 


Leu 


Cys 


^ y a 


Ti vs 


AT ^ 








740 


Met 


Val 


Gly 


Phe 






755 




Ala 


Ser 


A so 


H-i s 
n x o 




770 






Glv 


Arn 


A so 


Va 1 


785 








Gl v 


Phe 


Ala 


Thr 


T vr 
j, yx 


Glv 
oxy 


Leu 










820 


Lys 


Pro 


Asp 


Thr 






835 




Asn 


Leu 


Asn 


Glu 




850 






Gin 


Thr 


Lys 


Phe 


865 








Thr 


He 


Val 


Asp 


Leu 


Leu 


Lys 


Leu 








900 


Leu 


Lys 


Val 


Ala 



915 



Pro 


Ser 


Phe 


Ser 








440 


Val 


Cys 


Thr 


Cys 






455 




Phe 


Asn 


Gly 


Asp 




470 






Ser 


Glv 


Gin 


Thr 


485 








Pro 


Cvs 


Leu 


Ara 


Cvs 

-y o 


Gin 


Cys 


Glv 

oxy 








520 


Gin 


Phe 


Cys 


Glu 






535 




Leu 


Cvs 

\-y a 


Asn 


Asp 




550 






Pro 


Glv 




Thr 


565 








Cys 


He 


Asp 


Ser 


Cys 


Glv 

oxy 


Arg 


Cys 








600 


Cys 


Glu 


He 


Asn 






615 




Ala 


Pro 


Ala 


Cys 




630 






Ala 


Arg 


Val 


Arg 


645 








Arg 


Glu 


Ala 


Arg 


Thr 


Thr 


Ala 


Pro 








680 


Pro 


Thr 


Ala 


Leu 






695 




Ser 


Phe 








710 




Ala 


Leu 


Leu 


XI U 


725 










TiOii 

xitr u 


AT a 
Ala 




Lys 


Glu 


A so 


H-i c 








760 


Leu 


Asp 


Thr 


Pro 






775 




Val 

v ax 


Aro 




jjy o 




790 






His 


Ala 


Ala 


OCX 


805 








Leu 


Arn 
**x y 


XJC Li 


A") a 


Arg 


Glu 


Cvs 
v~y a 


Ala 








840 


Val 


Tyr 


Arg 


Gin 






855 




Arg 


Gin 


Gin 

OX J 1 


P m 




870 






Thr 


Val 


Leu 


Met 


885 








Thr 


Glu 


Lys 


Gin 


Pro 


Gly 


Tyr 


Tyr 








920 



34 



Asp 


Gly 


Leu 


Lvs 


Glu 


Leu 


Gin 


Lvs 








4 60 


Phe 


Val 


Cys 


Gly 






475 




Cys 


Asn 


Cvs 


Ser 




490 






Glu 


Gly 


Glu 


Asp 


505 








His 


Cys 


Val 


Cys 


T vr 

x yx 


Asp 


Asn 


Phe 








540 


Arg 


Gly 


Arg 


Cys 






555 




Glv 


Pro 


Ser 


Cys 




570 






Asn 


Glv 
oxy 


Glv 

vjxy 


Tie 

lie 


585 








His 


Cys 


His 


Gin 


T vr 

xyx 


Ser 


Ala 


Ser 








620 


Ser 


Ala 


Arg 


Arg 






635 




Asn 


Ala 


Thr 


Ser 




650 






Arg 


Tro 

Br 


TrD 


Cvs 


665 








Thr 


Ala 


Thr 


Pro 


Ser 


Tro 

XT 


Cys 


Thr 








700 


Leu 


He 


Pro 


Leu 






715 




Leu 


Leu 


Cys 


Tro 




730 






Leu 


Pro 


Cys 


Cvs 
v> y o 


745 








T vr 
xy x 


Met 


Leu 


Arg 


Met 


Leu 


Arg 


Ser 








780 


Val 


Thr 


Asn 


Asn 






795 




He 


Asn 


Pro 


Thr 




810 






Arg 


Leu 


Cys 


Thr 

X J IX 


825 








Gin 


Leu 


Arg 


Gin 


He 


Ser 


Glv 

oxy 


Val 








860 


Asn 


Ala 


Gly 


Lys 






875 




Ala 


Pro 


Arg 


Ser 




890 






Val 


Glu 


Gin 


Arg 


905 








Thr 


Leu 


Thr 


Ala 



Met 


Asp 


Ala 


Gly 


445 








Glu 


Val 


Aro 


Ser 


Gin 


Cys 


Val 


Cys 








480 


Thr 


Gly 


Ser 


Leu 






495 




Lvs 


Pro 


Cvs 


Ser 




510 






T vr 


Glv 

oxy 


Glu 


Gly 


525 








Gin 


Cys 


Pro 


Arg 


Ser 


Met 


Gly 


Gin 








560 


Asp 


Cys 


Pro 


Leu 






575 




\^y o 


Asn 


Glv 
oxy 


A TTT 

m y 




590 






Gin 


Ser 


Leu 


T vr 
iyx 


605 








Thr 


Arn 
ax y 


Ala 


^^r 


Glv 

oxy 


Ala 


Pro 

t X o 


Ala 








640 


Arg 


Ser 


Arg 


Trp 






655 




Ala 


Ala 


Pro 


Ser 




670 






TrD 


Lys 


Val 


Thr 


685 








Arg 


Ax*g 


Arg 


Asp 


XI 55 U 


Leu 


u ~ u 










720 


Lys 


Tyr 


Cys 


Ala 






735 




Asn 


Arg 


Glv 

oxy 


His 




7 50 

/ J V 






ulu 


Asn 




Mpf 


7 65 








gi v 

oxy 


Ben 


T .on 


- 

Lys 


Lie i- 


Gl n 


ttx y 


Prn 

cJCU 








300 


Glu 


Leu 


Val 


Pro 






815 




Glu 


Asn 


Leu 






830 






Glu 


Val 


Gin 
ox u 


Gl n 

UlU 


845 








His 


Lvs 

xiy a 


Leu 


Gin 


Lys 


Gin 


Asp 


His 








880 


Ala 


Lys 


Pro 


Ala 






895 




Ala 


Phe 


His 


Asp 




910 




Asp 


Gin 


Asp 


Ala 
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Arg Gly Met Val Glu Phe Gin Glu Gly Val Glu Leu Val Asp Val Arg 

930 935 940 

Val Pro Leu Phe He Arg Pro Glu Asp Asp Asp Glu Lys Gin Leu Leu 
945 950 955 ' 960 

Val Glu Ala He Asp Val Pro Ala Gly Thr Ala Thr Leu Gly Arg Arg 

965 970 975 

Leu Val Asn He Thr He He Lys Glu Gin Ala Arg Asd Val Val Ser 

980 985 990 

Phe Glu Gin Pro Glu Phe Ser Val Ser Arg Gly Asd Gin Val Ala Arg 

995 1000 * 1005 

He Pro Val He Arg Arg Val Leu Asp Gly Gly Lys Ser Gin Val Ser 

1010 1015 " 1020 

Tyr Arg Thr Gin Asp Gly Thr Ala Gin Gly Asn Arg Asp Tyr He Pro 
1025 1030 1035 1040 

Val Glu Gly Glu Leu Leu Phe Gin Pro Gly Glu Ala Trp Lys Glu Leu 

1045 1050 * " 1055 

Gin Val Lys Leu Leu Glu Leu Gin Glu Val Asp Ser Leu Leu Arg Gly 

1060 1065 * 1070 

Arg Gin Val Arg Arg Phe His Val Gin Leu Ser Asn Pro Lys Phe Gly 

1075 1080 1085 

Ala His Leu Gly Gin Pro His Ser Thr Thr He He He Arg Asp Pro 

1090 1095 1100 

Asp Glu Leu Asp Arg Ser Phe Thr Ser Gin Met Leu Ser Ser Gin Pro 
1105 1110 1115 1120 

Pro Pro His Gly Asp Leu Gly Ala Pro Gin Asn Pro Asn Ala Lys Ala 

1125 1130 1135 

Ala Gly Ser Arg Lys He His Phe Asn Trp Leu Pro Pro Ser Gly Lys 

1140 1145 1150 

Pro Met Gly Tyr Arg Val Lys Tyr Trp He Gin Gly Asp Ser Glu Ser 

1155 1160 1165 

Glu Ala His Leu Leu Asp Ser Lys Val Pro Ser Val Glu Leu Thr Asn 

1170 1175 * 1180 

Leu Tyr Pro Tyr Cys Asp Tyr Glu Met- Lys Val Cys Ala Tyr Gly Ala 
1185 1190 1195 ~ 1200 

Gin Gly Glu Gly Pro Tyr Ser Ser Leu Val Ser Cys Arg Thr His Gin 

1205 1210 " 1215 

Glu Val Pro Ser Glu Pro Gly Arg Leu Ala Phe Asn Val Val Ser Ser 

1220 1225 1230 

Thr Val Thr Gin Leu Ser Trp Ala Glu Pro Ala Glu Thr Asn Gly Glu 

1235 1240 1245 

He Thr Ala Tyr Glu Val Cys Tyr Gly Leu Val Asn Asp Asp Asn Arg 

1250 1255 1260 

Pro He Gly Pro Met Lys Lys Val Leu Val Asp Asn Pro Lys Asn Arg 
1265 1270 1275 1280 

Met Leu Leu He Glu Asn Leu Arg Glu Ser Gin Pro Tyr Arg Tyr Thr 

1285 1290 ~ 1295 

Val Lys Ala Arg Asn Gly Ala Gly Trp Gly Pro Glu Arg Glu Ala He 

1300 1305 1310 

He Asn Leu Ala Thr Gin Pro Lys Arg Pro Met Ser He Pro He He 

1315 1320 1325 

Pro Asp He Pro He Val Asp Ala Gin Ser Gly Glu Asp Tyr Asp Ser 

1330 1335 1340 

Phe Leu Met Tyr Ser Asp Asp Val Leu Arg Ser Pro Ser Gly Ser Gin 
•345 1350 1355 1360 

Arg Pro Ser Val Ser Asp Asp Thr Gly Cys Gly Trp Lys Phe Glu Pro 

1365 * 1370 1375 

Leu Leu Gly Glu Glu Leu Asp Leu Arg Arg Val Thr Trp Arg Leu Pro 

1380 1385 - 1390 

Pro Glu Leu He Pro Arg Leu Ser Ala Ser Ser Gly Arg Ser Ser Asp 

1395 1400 1405 

Ala Glu Ala Pro His Gly Pro Pro Asp Asp Gly Gly Ala Gly Gly Lys 
1410 1415 1420 
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Gly Gly Ser Leu Pro Arg Ser Ala Thr Pro Gly Pro Pro Gly Glu His 
1425 1430 1435 1440 

Leu Val Asn Gly Arg Met Asp Phe Ala Phe Pro Gly Ser Thr Asn Ser 

1445 1450 1455 

Leu His Arg Met Thr Thr Thr Ser Ala Ala Ala Tyr Gly Thr His Leu 

1460 1465 1470 

Ser Pro His Val Pro His Arg Val Leu Ser Thr Ser Ser Thr Leu Thr 

1475 1480 1485 

Arg Asp Tyr Asn Ser Leu Thr Arg Ser Glu His Ser His Ser Thr Thr 

1490 1495 1500 

Leu Pro Arg Asp Tyr Ser Thr Leu Thr Ser Val Ser Ser His Gly Leu 
1505 ** 1510 1515 1520 

Pro Pro lie Trp Glu His Gly Arg Ser Arg Leu Pro Leu Ser Trp Ala 

1525 1530 1535 

Leu Gly Ser Arg Ser Arg Ala Gin Met Lys Gly Phe Pro Pro Ser Arg 

1540 1545 " 1550 

Gly Pro Arg Asp Ser He He Leu Ala Gly Arg Pro Ala Ala Pro Ser 

1555 1560 1565 

Trp Gly Pro Asp Ser Arg Leu Thr Ala Gly Val Pro Asp Thr Pro Thr 

1570 1575 * 1580 

Arg Leu Val Phe Ser Ala Leu Gly Pro Thr Ser Leu Arg Val Ser Trp 
1585 1590 1595 1600 

Gin Glu Pro Arg Cys Glu Arg Pro Leu Gin Gly Tyr Ser Val Glu Tyr 

1605 1610 1615 

Gin Leu Leu Asn Gly Gly Glu Leu His Arg Leu Asn He Pro Asn Pro 

1620 " " 1625 1630 

Ala Gin Thr Ser Val Val Val Glu Asp Leu Leu Pro Asn His Ser Tyr 

1635 1640 1645 

Val Phe Arg Val Arg Ala Gin Ser Gin Glu Gly Tro Gly Arg Glu Arg 

1650 1655 1660 

Glu Gly Val He Thr He Glu Ser Gin Val His Pro Gin Ser Pro Leu 
1665 1670 1675 1680 

Cys Pro Leu Pro Gly Ser Ala Phe Thr Leu Ser Thr Pro Ser Ala Pro 

1685 1690 1695 

Gly Fro Leu Val Phe Thr Ala Leu Ser Pro Asp Ser Leu Gin Leu Ser 

1700 '.' 1705 1710 

Trp Glu Arg Pro Arg Arg Pro Asn Gly Asp He Val Gly Tyr Leu Val 

1715 1720 1725 

Thr Cys Glu Met Ala Gin Gly Gly Gly Pro Ala Thr Ala Phe Arg Val 

1730 1735 1740 

Asp Gly Asp Ser Pro Glu Ser Arg Leu Thr Val Pro Gly Leu Ser Glu 
1745 1750 1755 1760 

Asn Val Pro Tyr Lys Phe Lys Val Gin Ala Arg Thr Thr Glu Gly Phe 

1765 1770 1775 

Gly Pro Glu Arg Glu Gly He He Thr He Glu Ser Gin Asp Gly Gly 

1780 1785 1790 

Pro Phe Pro Gin Leu Gly Ser Arg Ala Gly Leu Phe Gin His Pro Leu 

1795 1800 ~ 1805 

Gin Ser Glu Tyr Ser Ser He Ser Thr Thr His Thr Ser Ala Thr Glu 

1810 1815 1820 

Pro Phe Leu Val Gly Pro Thr Leu Gly Ala Gin His Leu Glu Ala Gly 
1825 1830 1835 1840 

Gly Ser Leu Thr Arg His Val Thr Gin Glu Phe Val Ser Arg Thr Leu 

1845 1850 1855 

Thr Thr Ser Gly Thr Leu Ser Thr His Met Asp Gin Gin Phe Phe Gin 
1860 1865 1870 

Thr 



<210> 18 
<211> 5622 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<40Q> 18 

atggcagggc cacgccccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccaa taaagagctg cacggagtgt 120 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 240 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt tgagctggag 360 

gtgtttgagc cactggagag ccccgtggac ctgtacafccc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 4 80 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 540 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccttc 600 

tccttcaaga acgtcatcag cctgacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc catcctgcag 720 

acagctgtgt gcacgaggga cattggctgg cgcccggaca gcacccacct gctggtcttc 780 

tccaccgagt cagccttcca ctatgaggct gatggcgcca acgtgctggc tggcatcatg 840 

agccgcaacg atgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgcctgctcg ccaagcacaa catcatcccc 960 

atctttgctg tcaccaacta ctcctatagc tactacgaga agcttcacac ctatttccct 1020 

gtctcctcac tgggggtgct gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgctc caacctggac atccgggccc tagacagccc ccgaggcctt 1140 

cggacagagg tcacctccaa gatgttccag aagacgagga ctgggtcctt tcacatccgg 1200 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 1260 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 1440 

agcgagggct ggagtggcca gacctgcaac tgctccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcggg 1560 

cactgtgtgt gctacggcga aggccgctac gagggtcagt tctgcgagta tgacaacttc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1680 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc actgtgagtg tggccgctgc 1800 

cactgccacc agcagtcgct ctacacggac accatctgcg agatcaacta ctcggcgtcc 1860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 2040 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc tggtggctca tccccctgct cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 

tgcctggcac ttctcccgtg ctgcaaccga ggtcacatgg tgggctttaa ggaagaccac 2280 

tacatgctgc gggagaacct gatggcctct gaccacttgg acacgcccat gctgcgcagc 2340 

gggaacctca agggccgtga cgtggtccgc tggaaggtca ccaacaacat gcagcggcct 2400 

ggctttgcca ctcatgccgc cagcatcaac cccacagagc tggtgcccta cgggctgtcc 24 60 

ttgcgcctgg cccgcctttg caccgagaac ctgctgaagc ctgacactcg ggagtgcgcc 2520 

cagctgcgcc aggaggtgga ggagaacctg aacgaggtct acaggcagat ctccggtgta 2580 

cacaagctcc agcagaccaa gttccggcag cagcccaatg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gatggcgccc cgctcggcca agccggccct gctgaagctt 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggctactac 2760 

accctcacbg cagaccagga cgcccggggc atggtggagt tccaggaggg cgtggagctg 2820 

gtggacgtac gggtgcccct ctttatccgg cctgaggatg acgacgagaa gcagctgctg 2880 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggtaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctttg agcagcctga gttctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagctc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgtc 3240 

cagctcagca accctaagtt tggggcccac ctgggccagc cccacfcccac caccatcatc 3300 
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atcagggacc 
ccccctcacg 
aagatccatt 
bggattcagg 
gagctcacca 
cagggcgagg 
gagccagggc 
gagccggctg 
gatgacaacc 
atgctgctta 
aacggggccg 
aggcccatgt 
gactacgaca 
aggcccagcg 
gagctggacc 
gccagcagcg 
gcgggcggga 
ctggtgaatg 
accacgacca 
ctaagcacat 
cactcgacca 
cctcccatct 
agtcgggctc 
gctgggaggc 
gacacgccca 
caggagccgc 
ggcggtgagc 
gacctcctgc 
ggccgagagc 
tgtcccctgc 
ttcactgccc 
ggggatatcg 
gcattccggg 
aacgtgccct 
gagggcatca 
gccgggctct 
agcgccaccg 
ggctccctca 
acccttagca 



cagatgaact 
gcgaccbggg 
tcaactggct 
gtgactccga 
acctgtaccc 
gacccfcacag 
gfcctggcctt 
agaccaacgg 
gacctattgg 
ttgagaacct 
gctgggggcc 
ccatccccat 
gcttccttat 
tctccgatga 
tgcggcgcgt 
ggcgctcctc 
agggcggcag 
gccggatgga 
gtgctgctgc 
cctccaccct 
cactgccgag 
gggaacacgg 
agatgaaagg 
cagcagcgcc 
cccgcctggt 
ggtgcgagcg 
tgcatcggct 
ccaaccactc 
gtgagggtgt 
caggctccgc 
tgagcccaga 
tcggctacct 
tggatggaga 
acaagttcaa 
tcaccataga 
tccagcaccc 
agcccttcct 
cccggcatgt 
cccacatgga 



ggaccggagc 
cgccccgcag 
gcccccttct 
atccgaagcc 
gtattgcgac 
ctccctggtg 
caatgtcgtc 
tgagatcaca 
gcccatgaag 
tcgggagtcc 
tgagcgggag 
catccctgac 
gtacagcgat 
cactggctgc 
cacgtggcgg 
cgacgccgag 
cctgccccgc 
ctttgccttc 
ctatggcacc 
cacacgggac 
ggactactcc 
gaggagcagg 
gttcccccct 
ctcctggggc 
gttctctgcc 
gccgctgcag 
caacatcccc 
ctacgtgttc 
catcaccatt 
cttcactttg 
ctcgctgcag 
ggtgacctgt 
cagccccgag 
ggtgcaggcc 
gtcccaggat 
gctgcaaagc 
agtgggtccg 
gacccaggag 
ccaacagttc 



ttcacgagtc 
aaccccaatg 
ggcaagccaa 
cacctgctcg 
tatgagatga 
tcctgccgca 
tcctccacgg 
gcctacgagg 
aaagtgcfcgg 
cagccctacc 
gccatcatca 
atccctatcg 
gacgttctac 
ggctggaagt 
ctgcccccgg 
gcgccccacg 
agtgcgacac 
ccgggcagca 
cacctgagcc 
tacaactcac 
accctcacct 
cttccgctgt 
tccaggggcc 
ccagactctc 
ctggggccca 
ggctacagtg 
aaccctgccc 
cgcgtgcggg 
gaatcccagg 
agcactccca 
ctgagctggg 
gagatggccc 
agccggctga 
aggaccactg 
ggaggfcccct 
gagtacagca 
accctggggg 
tttgtgagcc 
ttccaaacfct 



agatgttgtc 
ctaaggccgc 
tggggbacag 
acagcaaggt 
aggtgbgcgc 
cccaccagga 
tgacccagct 
bctgctatgg 
ttgacaaccc 
gctacacggt 
acctggccac 
tggacgccca 
gctctccatc 
tcgagcccct 
agctcatccc 
ggcccccgga 
ccgggccccc 
ccaactccct 
cacacgtgcc 
tgacccgcfcc 
ccgtctcctc 
cctgggccct 
cacgagactc 
gcctgactgc 
catctctcag 
tggagtacca 
agacctcggt 
cccagagcca 
tgcacccgca 
gbgccccagg 
agcggccacg 
aaggaggagg 
ccgtgccggg 
agggcttcgg 
tcccgcagct 
gcatctccac 
cccagcacct 
ggacactgac 
ga 



atcacagcca 
fcgggtccagg 
ggtaaagtac 
gccctcagbg 
ctacggggcb 
agtgcccagc 
gagctgggcb 
cctggtcaac 
taagaaccgg 
gaaggcgcgc 
ccagcccaag 
gagcggggag 
gggcagccag 
gctgggggag 
gcgcctgtcg 
cgacggcggc 
cggagagcac 
gcacaggatg 
ccaccgcgtg 
agaacacbca 
ccacggcctc 
ggggtcccgg 
tataatccbg 
tggtgtgccc 
agtgagctgg 
gctgctgaac 
ggtggtggaa 
ggaaggctgg 
gagcccactg 
cccgctggbg 
gaggcccaat 
tccagccacc 
cctcagcgag 
gccagagcgc 
gggcagccgt 
cacccacacc 
ggaggcaggc 
caccagcgga 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4 560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5530 
5622 



<210> 19 
<211> 5622 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note - 
synthetic construct 

<400> 19 

atggcagggc cacggcccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccaa taaagagctg cacggagtgt 120 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 240 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt tgagctggag 360 

gtgtttgagc cactggagag ccccgtggac ctgtacatcc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 480 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 54O 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccthc 600 
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tccttcaaga acgtcatcag cctgacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc cafccctgcag 720 

acagctgtgt gcacgaggcfa cattggctgg cgcccggaca gcacccacct gctggtcttc 780 

tccaccgagt cagccttcca ctatgaggct gatggcgcca acgtgctggc tggcatcafcg 840 

agccgcaacg airgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgccfcgctcg ccaagcacaa catcatcccc 960 

atcfcttgctg tcaccaacta ctcctafcagc tactacgaga agcttcacac ctatttccct 1020 

gtctccfccac tgggggtgcfc gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgcfcc caacctggac atccgggccc tagacagccc ccgaggcctt 1140 

cggacagagg tcacctccaa gatgttccag aagacgagga ctgggtcctt tcacafcccgg 1200 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 1260 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 1440 

agcgagggct ggagtggcca gacctgcaac tgcfcccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcggg 1560 

cactgtgtgt gctacggcga aggccgctac gagggtcagt tctgcgagta tgacaacttc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1680 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc actgtgagtg tggccgctgc 1800 

cactgccacc agcagtcgct ctacacggac accafcctgcg agatcaacta ctcggcgtcc 1860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 204 0 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc tggtggctca tccccctgct cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 

tgcctggcac ttctcccgtg ctgcaaccga ggtcacatgg tgggctttaa ggaagaccac 2280 

tacatgctgc gggagaacct gatggcctct gaccacttgg acacgcccat gctgcgcagc 2340 

gggaacctca agggccgtga cgtggtccgc tggaaggtca ccaacaacat gcagcggcct 2400 

ggctttgcca ctcatgccgc cagcatcaac cccacagagc tggtgcccta cgggctgtcc 24 60 

ttgcgcctgg cccgcctttg caccgagaac ctgctgaagc ctgacactcg ggagtgcgcc 2520 

cagctgcgcc aggaggtgga ggagaacctg aacgaggtct acaggcagafc ctccggtgta 2580 

cacaagcfccc agcagaccaa gttccggcag cagcccaatg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gacggcgccc cgctcggcca agccggccct gctgaagctt 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggctactac 2760 

accctcactg cagaccagga cgcccggggc atggtggagt hccaggaggg cgtggagctg 2820 

gtggacgtac gggtgcccct ctttatccgg cctgaggatg acgacgagaa gcagctgctg 2880 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggtaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctttg agcagcctga gttctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagctc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgtc 324 0 

cagctcagca accctaagtt tggggcccac ctgggccagc cccactccac caccatcatc 3300 

atcagggacc cagatgaact ggaccggagc ttcacgagtc agatgttgtc atcacagcca 3360 

cccccfccacg gcgacctggg cgccccgcag aaccccaatg ctaaggccgc tgggtccagg 3420 

aagatccatt tcaactggct gcccccttct ggcaagccaa tggggtacag ggtaaagtac 3480 

tggattcagg gtgactccga atccgaagcc cacctgctcg acagcaaggt gccctcagtg 3540 

gagctcacca acctgtaccc gtattgcgac tatgagatga aggtgtgcgc ctacggggct 3600 

cagggcgagg gaccctacag ctccctggtg tcctgccgca cccaccagga agtgcccagc 3660 

gagccagggc gtctggcctt caatgtcgtc tccrccacgg tgacccagct gagctgggct 3720 

gagccggctg agaccaacgg tgagafccaca gcctacgagg tcfcgctatgg cctggtcaac 3780 

gatgacaacc gaccfcattgg gcccatgaag aaagtgctgg ttgacaaccc taagaaccgg 3840 

afcgctgctta ttgagaacct fccgggagtcc cagccctacc gctacacggt gaaggcgcgc 3900 

aacggggccg gcfcgggggcc tgagcgggag gccatcatca acctggccac ccagcccaag 3960 

aggcccatgt ccatccccafc catccctgac atccctatcg tggacgccca gagcggggag 4020 

gactacgaca gcttccfctat gtacagcgat gacgttctac gctctccatc gggcagccag 4080 

aggcccagcg tctccgatga cactggctgc ggctggaagt tcgagcccct gctgggggag 4140 

gagctggacc tgcggcgcgt cacgtggcgg ctgcccccgg agctcatccc gcgcctgtcg 4200 

gccagcagcg ggcgctcctc cgacgccgag gcgccccacg ggcccccgga cgacggcggc 4260 

gcgggcggga agggcggcag cctgccccgc agtgcgacac ccgggccccc cggagagcac 4320 
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ctggtgaatg gccggatgga cttfcgccttc ccgggcagca ccaactccct gcacaggatg 4380 

accacgacca gtgctgctgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4 440 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4 500 

cactcgacca cactgccgag ggactactcc accctcacch ccgfcctcctc ccacggccbc 4560 

cctcccatct gggaacacgg gaggagcagg cttccgctgt cctgggccct ggggtcccgg 4 620 

agfccgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc tataatcctg 4 680 

gcfcgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4740 

gacacgccca cccgcctggt gttctctgcc cfcggggccca catctctcag agtgagctgg 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 4860 

ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggfc ggtggtggaa 4 920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4 980 

ggccgagagc gtgagggtgt catcaccatt gaatcccagg tgcacccgca gagcccactg 504 0 

tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 5160 

ggggatatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg tccagccacc 5220 

gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5280 

aacgtgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 534 0 

gagggcatca tcaccataga gtcccaggat ggaggtccct tcccgcagct gggcagccgt 5400 

gccgggctct tccagcaccc gctgcaaagc gagtacagca gcatctccac cacccacacc 54 60 

agcgccaccg agcccttcct agtgggtccg accctggggg cccagcacct ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag tttgtgagcc ggacactgac caccagcgga 5580 

acccttagca cccacatgga ccaacagttc ttccaaactt ga " " " 5622 



